FIGURE 1 



GGGGCTTCGGCGCGAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGT CT GAAGAC TAACAT T T TGTGAAGT TGTAAAACAGAAAACC T GT T AGAAATGTGGT GGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACAT TACTGCAGTAACACTCCACCATATAGACCCGGCTT TACCT TATAT CAGTGACACTGG 
TACAGT AGCT CCAGAAAAAT GC T TAT T T GGGGCAAT GC T AAATAT T GC GG C AG T T T TAT GC AT T G 
CTACCATTTATGTTCGTTATAAGCAAGTTCATGCTCTGAGTCCTGAAGAGAACGTTATCATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
TATAT AT GT T TGT TCAGACCAT CC T T T CCTACCAAAT GCAGCCCAAAAT CCATGGCAAACAAGTC 
TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
ATCAGT TT TGCACAGT GGCAAT T TTGGGAC TGAT T TAGAACAGAAAC T CCAT TGGAACCCCGAGG 
ACAAAGGT T AT GTGC T TCACAT GAT CACT AC TGCAGC AGAAT GGT C TAT GT CAT TTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
TGGATTAACCCTCTATGACACTGCACCTTGCCCTATTAACAATGAACGAACACGGCTACTTTCCA 
GAGATATTTGATGAAAGGATAAAATATTTCTGTAATGATTATGATTCTCAGGGATTGGGGAAAGG 
T T CACAGAAGTT GCT T AT T CT TC T CTGAAAT TTTCAACCACT TAATCAAGGC TGACAGT AACACT 
GATGAATGCTGATAATCAGGAAACATGAAAGAAGCCATTTGATAGATTATTCTAAAGGATATCAT 
CAAGAAGAC TAT TAAAAACACCTATGCCT ATAC T T T T T TAT CTCAGAAAAT AAAGTCAAAAGACT 
ATG 



FIGURE 2 



<subunit 1 of 1, 266 aa, 1 stop 
<MW: 29766, pi: 8.39, NX{S/T): 0 

MWWFQQGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
LCIAT I YVRYKQVHALSPEENVI IKLNKAGLVLGILSCLGLS IVANFQKTTLFAAHVSGAVLT FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myr i s toyl a t i on s i te s . 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACT TCGCC ATGA GT TT CC TC AT CGACT CCAGCAT CATGATTACCTCCCAGATACT ATTT TTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
TTTT CATGGTGCCT T TTTACATTGGCTAT TT TAT TGTGAGCAATATC CGACTACTGCATAAACAACGA 
jM, CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
fj! CATTCTCAGCCCAAAACATGGGATCTTATCCATAGAACAGCTCATCAGCCGGGTTGGTGTGATTGGAG 
U TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 

m 

f ; C T CAGGAAT GT GACT GACAC GGAT AT T C T AGC C CT G GAAC GGC GACT GC TG C AAAC CAT G GAT AT GAT 

jLjif 

1 % CATAAGCAAAAAGAAAAGGAT GGCAAT GGCACGGAGAACAATGT TCCAGAAGGGGGAAGT GCATAACA 
AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
CTTATTCAACAGGAAGTGGAT GCT T TGGAAGAATTAAGCAGGCAGCT TTTT CT GGAAACAGCT GATCT 
ATAT GCTACCAAGGAGAGAATAGAATACT CCAAAACCTTCAAGGGGAAATATTTTAATTT TCTT GGT T 
ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
GGGAAAAC GGAT C C T GT CAC AAG AG GC AT T GAGAT C ACTG T GAAT TAT C T GGGAAT C CAAT TT GAT GT 

CI 

GAAGT TTTGGT CCCAACACATTT C CTT CATTC T TGT TGGAATAATCATC GT CACATCCAT CAGAGGAT 

2 TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
Mi CTATTAGCACAGATAATGGGCATGTACTTTGTCTCCTCTGTGCTGCTGATCCGAATGAGTATGCCTTT 

AGAATACCGCACCATAATCACTGAAGTCCTTGGAGAACTGCAGTTCAACTTCTATCACCGTTGGTTTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
-GAGAAGCAAATGGCACCTTGAACTTAAGCCTACTACAGACTGTTAGAGGCCAGTGGTTTCAAAATTTA 
GATATAAGAGGGGGGAAAAAT GGAACCAGGGCCTGACATT T TATAAACAAACAAAAT GCT ATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACAT GAGAGGGAGAAC T AAC T CAAGAC AATACT C AGCAGAGAGCAT C C CGT GT GGAT AT GAGG CT GG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
C TAT GGT AGCT GAGC C AAAC ACG T AGGAT TTCCGTTT TAAG GT T CAC AT GGAAAAG GT T AT AGC TTTG 
C CT T GAGAT TGAC T CAT T AAAAT CAGAGAC T GT AACAAAAAAAAAAAAAAAAAAAAAGGG C GGC CGC G 
AC T CT AGAGTC GACC T GCAGAAGCT T GGC C GC C AT GG CCCAAC T T GT T TAT TG C AGC T T AT AAT G 



II! 
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FIGURE 4 

MSFLIDSSIMITSQILFFGFGWLFFMRQLFKDYEIRQYWQVIFSVTFAFSCTMFELIIFEILGV 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMALLSGFGAWCPYTYMSYFLROTTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGETO 

LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TWYLGIQFDVKFWSQHISFILVGIIIVTSIRGLLITLTKFFYAISSSKSSNVIVLLLAQIMGMY 
FVS S VLL I RMSMPLE YRT 1 1 TEVLGE LQFNFYHRWFDVI FLVS ALS S I L FL YLAHKQAPEKQMAP 

Important features: 
Signal peptide: 

amino acids 1-23 

Potential transmembrane domains: 

amino acids 37-55, 81-102, 150-168, 288-311, 338-356, 375-398, 
425-444 

N-glycosylation sites. 

amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobal ami n -binding proteins 

amino acids 151-160 
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FIGURE s 



AGCAGGGAAATCCGGAT GT CTCGGT TATGAAGT GGAGCAGT GAG T GTGAGCC T CAACAT AGT T CC 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
G T CAT GCAGAAT TAT GGGGATCACCCT T GT GAGCAAAAAG GC GAACCAGCAGC T GAATT TCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGT T TGCAGC CTAT T GT TACAACT CATC T GAT ACT T GGAC T AAC T CGTGCAT TCCAGAA 
AT TATCACCACCAAAGATCCCATATTCAACACTCAAACTGCAACACAAACAACAGAATTTAT TGT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
C TGCTCCAGCT TCCACT TCTAT TCCACGGAGAAAAAAAT TGATT TGTGTCACAGAAGTT T TT ATG 
G AAAC T AGC ACC AT G T C T AC AGAAAC T GAAC CAT T T G T T G AAAAT AAAG C AG CAT T C AAG AAT GA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACAAGAAT 
CAGCAGAAGGAAATGATCGAAACCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAA 
TGAGGAATCAAAGAAAACT GATAAAAACCCAGAAGAGT CCAAGAG TCCAAGCAAAAC TACCGT GC 
GATGCCTGGAAGC TGAAGT T TAGAT GAGACAGAAAT GAGGAGACACACCT GAGGCT GGT T TCT T T 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
CCCTTGGTTCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
T T C TAGCCT GGC TATGT CCTAATAATAT CCCACT GGGAGAAAGGAGT T T T GCAAAGT GCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTT TC TC TACT GAT T C CGCAGC T CAGACCC T T T CT T CA 
GC T CT GAAAGAGAAACACGTATCCCACCT GACAT GT CC T T CT GAGCCCGGTAAGAGCAAAAGAAT 
GGCAGAAAAGT T TAGCCCCTGAAAGCCAT GGAGAT TCT CAT AAC T TGAGACC TAATC TCT GT AAA 
GC TAAAATAAAGAAATAGAACAAGGC T GAGGATACGACAGT AC AC TGTCAGCAGGGACT GTAAAC 
ACAGACAGGGTCAAAGTGTTT TCTCTGAACACATTGAGT TGGAATCACT GTT TAGAACACACACA 
C T TACT T T T TC TGGTC T C TACCAC TGCT GAT AT TT TCTC TAGGAAATATACT TT TACAAGTAACA 
AAAAT AAAAAC TCT T AT AAAT T T C TAT T T T TAT C T GAG T T AC AG AAAT GAT T AC T AAGGAAG AT T 
ACTCAGTAATT TGT T TAAAAAGT AAT AAAAT TC AAC AAAC AT TT GC TGAATAGCTACTATATGT C 
AAG TGCT GT GCAAGGTAT TACAC TC T GT AATT GAATAT TAT T CCTCAAAAAAT TGCACATAGTAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
T T T TAT T T T T GC TGAGACTAAT CT TAT T CAT T TT C T CTAATATGGCAACCAT TATAACCT TAAT T 
TAT TAT TAACATACC T AAGAAGTACAT T G TTACCTC TATATACCAAAGCACAT T T TAAAAGTGCC 
AT TAACAAATGTATCACTAGCCCTCCT TT TTCCAACAAGAAGGGACTGAGAGATGCAGAAATAT T 
TGTGACAAAAAAT TAAAGCAT T TAGAAAACT T 



FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
L S LAGKDQVE T ALKAS FE T G S YGWVGDG FWI S R I S PNPKCGKNGVG VL I WKVP VS RQ FAAYC YN 
S SDTWTNSCI PE 1 1 TTKDP I FNTQTATQTTEFI VSDSTYSVAS PYST I PAPTTTPPAPASTS I PR 
RKKL I C VTEVFMET S TMS TE TE P FVENKAAFB^ 

RYVKAFP FTNKNQQKEMIE TKVVKEEKANDSNPNEESKKT.DKNPEE S KS PS KT T VRCLE AE V 



Signal sequence: 

amino acids 1-16 



HI Transmembrane domain: 

W amino acids 235-254 



N-glycosylation site. 

amino acids 53-57, 130-134, 289-293 

111 Casein kinase II phosphorylation site. 

amino acids 145-149, 214-218 

w 

ff:| Tyrosine kinase phosphorylation site. 

amino acids 79-88 



N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTC T CAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGAT GT T CCGC GAGGT TGAGGAAC TGAT 
GGAGGACACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTA 
AAGCATCAT CAGAAGTGAACCTGGCAAACT TACCTCCCAGCTATCACAATGAGACCAACACAGAC 
ACGAAGGT T GGAAATAATACCAT CCAT GTGCACCGAGAAAT TCACAAGATAACCAACAACCAGAC 
T GGACAAAT GGTC TT T T CAGAGACAGT TAT CACATCT GTGGGAGACGAAGAAGGCAGAAGGAGCC 
jM* ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
gi ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
m GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
5 ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC 
B ' l f. CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
t-' ;?j5 GAGAT CC T GCTGCCCAGAGAGGTCCCC GAT GAG TATGAAGT T GGCAGC T TCATGGAGGAGGT G CG 

4* C C AGGAGC T G GAGGAC C T G GAGAGGAG C C T GAC T GAAGAGAT GGC GC T G GGG GAGC CTGCGGCTG 

W C CGCCGCTGCAC TGCT GGGAGGGGAAGAGAT T TAGA TC TGGACCAGGCT GT GGGTAGAT GTGCAA 

gj TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGGTGACCAGGCTTCTTCCTA 
If! CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
fi CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
T£ GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
J{ TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
W TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
TTTTCTTGAGGCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACATCCCTCTAAAAGTAAA 
CTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAATGAAGACAATGAT 
ATTGACACTGTCCCTCTTTGGCAGTTGCATTAGTAACTTTGAAAGGTATATGACTGAGCGTAGCA 
TACAGGTTAACCTGCAGAAACAGTACTTAGGTAATTGTAGGGCGAGGATTATAAATGAAATTTGC 
AAAAT CACT TAGCAGCAAC T GAAGACAAT TATCAACCACGT GGAGAAAATCAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGT.TTAAAGTTGCA 
CATGAT TGTATAAGCATGCTTTCTTTGAGTT TTAAATTATGTATAAACATAAGT TGCATT TAGAA 
ATCAAGCATAAATCACTTCAACTGCAAAAAAAAAAAAAAAAAAAAAAAAAA 



n 



APP_ID= 
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FIGURE 8 

MQRLGATLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFRSVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEVNLANL P PS YHNETNTDTKVGNNT I HVHRE IHKI TNNQTGQMVFS E 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFASFQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
T KMATRGSNGT I CDNQRDCQPGLCCAFQRGLL FPVC T PLPVEGEL.CHDPAS RLLDL I TOELE PDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RSLTEEMALGE PAAAAAALLGGEE I 

Signal sequence: 

amino acids 1-19 

N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 
Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 
327-331 

N-myristoylation site. 

amino acids 202-208, 217-223 

Amidation site. 

amino acids 140-144 
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FIGURE Q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCAACCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
GAG TATATT GACCACAAGCT CAGAGGAGT GCAGAACC GT GT AGCCCGC TGCATCATGT GCT G TT T 
CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 
TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 



m ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
Id GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 



ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 



GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 
GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
AGAT T CTGGGCAAGAAGAACGAGGCGCCC CCGGACAACAAGAAGAGGAAGAAGTGACAGCT CCGG 
CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 
CTCCATTTTGTGGTAAAT^AAGGTTTTAGGCCAGGCGCCGTGGCTCAGGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGATT TTAT TAAAGATAT T T TGT TAAC TC 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGL I QRSVFNLQI YGVLGLFWTL 
NWVLALGQC VLAGAFAS FYWAFHKPQD I PT FPL I S AFI RT LRYHTGS LAFGAL I LTLVQ IARVI L 
E Y I DHKLRGVQNP VARC IMCC FKCCLWCLEKFI KFLNRNAY IM I AI YGKNFCVS AKNAFMLLMRN 
IVRWVLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 

Important features; 

!p " Transmembrane domains: 

-pi 

Si! amino acids 57-80 (type II), 110-126, 215-231, 254-274 

y| ■ N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

?» 

Hypothetical YBR002c family proteins. 

Q amino acids 276-288 

I 



Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 



Amidation site. 

amino acids 306-310 
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FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATCGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTCAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
jpfe ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
CI CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCGTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
/I TTCTACTTGCTGTCGATCGCGGCCGTGGCGCTGATGTTCATGTACTACACTGAGCCCAGCGGCTGCCA 
•%l CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 
jU CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
4» TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
^ GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
y TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
2L! AACAGCCTGATGCAGACCGAGGAGTGCCCACCTATGCTAGACGCCACACAGCAGCAGCAGCAGCAGGT 
f|I GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
r! AC TTCTGCCTGGTGCTGGCCT CAC T GCAC GT CAT GAT GAC GCT CACC AACT GGTACAAGC C C G G TGAG 

f|| ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
GACTTCGTGCCTTACTGAGTCTCTAAGACTTTTTCTAATAAACAAGCCAGTGCGTGTAAAAAAAA 
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FIGURE 12 



MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPWVCEEGAGIPTVLQGHIDCGSLLGYRAVYRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWETFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLS IAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVS IAAVLPKV 
QDAQPNS GLLQAS VI TLYTM FVT WSALS S I PEQKCNPHLPT QLGNE T WAGPEG YE TQWWDAPS I 
VGLIIFLLCTLFISLRSSDHRQVNSLMQTEECPPMLDATQQQQQQVAACEGRAFDNEQDGVTYSY 
SFFHFCLVLASLHVMMTLTNWYKPGETRKMISTWTAVWVKICASW 



H FS 

fat* 

0 
m 



Signal sequence : 

amino acids 1-20 



P' 5 Transmembrane domains: 

* amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 

jwj 272-283, 324-340, 391-406, 428-444 

ill 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAAC'CACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGT TGGAAAAAGAC T CCT GT ARC CCT CCT CCAGG ATGA ACCACC T GC CAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGATTGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
TGAATGGAGGCATTGAGAACACATTAGAGAAGGAGGT GATGCAGTATGACTACTAT TCT TCATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACTGCGCCATTGGTGGGCAATAGCGTTGACAACGGCAGTGACCAGTGCCTTTTTACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
C TTGC CTGGAT T GAGACGT GGT T CC TGGAT T TCAAAGT GT T ACC T CAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GTCAGT T T TAT TCCCC T CCT GAAT CCGAAGC AGGATC T GAAGAAGC TGAAGAAAAACAGGAC AG T 
GAGAAACCACT TT TAGAACTATGAGTAC T AC TT T TGT TAAAT GTGAAAAACCC TCACAGAAAGTC 
ATCGAGGCAAAAAGAGGCAGGCAGTGGAGTCTCCCTGTCGACAGTAAAGTTGAAATGGTGACGTC 
CACTGCTGGCTTTATTGAACAGCTAATAAAGATTTATTTATTGTAATACCTCACAAACGTTGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
TCAGTGAGACTGAGCC TGAT GTG T TAACAAATAGGTGAAGAAAGTC T T GTGCTGTAT TCC T AATC 
AAAAGACT TAAT ATAT T GAAGTAACAC T TT T T TAGTAAGCAAGATACCT T T TTAT TTCAATT CAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTTGCAAAAGGGGAAGAAAGGAATTGCGAATACATGTAAAATGTCACCAGACATTTGTATTATTT 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACT TAAACCAT TCATATCATGTT TCCTTTGCGT TCAGCCAATTTCAAT TAAAATGAA 
CTAAAT TAAAAA 
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FIGURE id 



MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTF'DLLF 
VTLLW I IE LNVNGG I ENTLEKEVMQYDYYS S YFD I FLLAVFRFKVL I LAYAVCRLRHWWAI ALTT 
AVTSAFLIAKVILSKLFSQGAFGYVLPIISFILAWIETWFLDFKVLPQEAEEENRLLIVQDASER 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
Signal peptide: 

H amino acids 1-20 

C! 

m Transmembrane domains: 

y •■■ 

||| amino acids 54-72, 100-118, 130-144, 146-166 

\l N-myristoylation sites. 

amino acids 14-20 , 78-84, 79-85, 202-208, 217-223 

s 



m 

r\ 

rii 
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FIGURE 15 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 

CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGAAGATCTGCTCCAG 

GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCGCTGGGGCCTGGGGTGCAGGGCTGCCCAT 

CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 

CGAGAC GT GCGAC CCGACACGGT GGGGCTG TACG TCT T T GAGAAC GGCATCACCAT GC T CGACGC 

AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 

TGCCCAGCGGGGTCTTCCAGCCAGTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 

CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 

CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 

AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 

CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 

GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 

TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 

CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 

TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 

TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 

TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 

CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 

CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 

TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 

CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGT.CCACCTGCCTCA 

ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 

GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 

GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 

AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 

GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 

GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 

GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 

CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 

GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 

ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 

CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 

ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATC TAAG CCA 

GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 

ACACCACGTAAGTTCTCAGTCCCAAGCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 

GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 

CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 

CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 

TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 

GGC T GT G T GAC T C T AGT C TT GGCC CCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 

TTTAGGAACATGTT TT GCTT TTTTAAAATATATATATT TATAAGAGATCCTTTCCCATT TATTCT 

GGGAAGATGTT TTTCAAACT CAGAGACAAGGACTTTGGTTTT TGTAAGACAAACGATGATATGAA 

GGC C T TT TGTAAGAAAAAATAAAAGAT Gi\AGT GTGAAA 



FIGURE 16 

MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHNSLIALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGIAALQELDVSNLSLQALPGDLSGLFPRLRLIAAA^ 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWKEPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
M VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
tl TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSJST^ 
ii LLAAIAAVGAAYCTORGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPPCATEGG 

y ? 

y CEVPLMGFPGPGLQS PLHAKPYI 

SJ 

H Important features: 

p. 

^ Signal peptide: 

$ 

amino acids 1-23 

HI Transmembrane domain: 

Q amino acids 579-599 

W EGF-like domain cysteine pattern signature/ 

r T,? amino acids 430-4 42 

ni 

Leucine zipper pattern. 

amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTC ATG CG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCG'GTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
AT GAAGAAGGCAGCCAGGATGAATCC T TAGAT TCCAAGAC T AC T T TGACAT CAGAT GAGT CAGTA 
AAGGAC CAT AC TAC TGC AGGCAGAG T AGT T GC TGGT C AAAT AT T T C T T GAT T CAGAAGAAT C T GA 
AT TAGAATCCTCTAT TCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATATCAGCTTTCTAGAGTCTCCAAATCCAGAAAACAAGGACTATGAAGAGCCAAAGAAAGTA 
CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
N T T T CCTAGATAAGGAGTAT GATGAAT GT ACAT CAGAT GGGAGGGAAGATGGCAGAC TGTGGT GTG 

5f CTACAACCT ATGACTACAAAGCAGAT GAAAAGTGGGGC T T T T GT GAAAC T GAAGAAGAGGCT GCT 

m AAGAGAC GGCAGATGCAGGAAGCAGAAATGAT GT ATCAAACT GGAATG AAAAT C C T TAAT GG AAG 

yi CAAT AAGAAAAGCCAAAAAAGAGAAGCATAT CGGTAT CTCCAAAAGGCAGCAAGCATGAACCATA 

SJ CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 

'N 5 GCAGCGAGAGAGATGT TTGAGAAGCTGACT GAGGAAGGCTCTCCCAAGGGACAGACTGCTCT T GG 

it? 

CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TT GGAGCT C T TGGGGGCAATC TAATAGCCCACAT GGT T TT GGTAAGTAGAC T T TAGT GGAAGGC T 
||| AATAAT AT TAAC AT CAGAAGAAT T TGTGGT T TAT AGCGGCCACAAC T T T T TCAGC T T TCATGATC 

Q CAGATTTGCTTGTATTAAGACCAAATATTCAGTTGAACTTCCTTCAAATTCTTGTTAATGGATAT 

W AACACATGGAATCTACATGTAAATGAAAGTTGGTGGAGTCCACAAT TT TTCTT TAAAATGAT TAG 

p| 

£\ TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 

I %$ 

GAAT TGTCAGAATCAT TT TT TACAT TAGAT TATCATAATT T T AAAAAT T T T T C T T TAGT T TT TCA 
AAAT TT TGT AAATGGT GGCTATAGAAAAACAACATGAAATAT TATACAATAT T TT GCAACAATGC 
CC TAAGAAT TGT TAAAAT T CATGGAGT T AT T TGT GCAGAATGAC TCCAGAGAGCTC TAC T TT CTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGATTAAC TCAT T TT TAAT AAAAT TAT GT C TAAGATTAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA&AAAAAAAAAA 
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FIGURE 18 



m 



MRVRIGLTLLLCAVLLSIASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAK^ 

EAYRYLQKAASMNHTKALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALV Y YT FGALGGNL IAHMVLVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 



I? f N-glycosylation sites. 

W amino acids 195-199, 217-221, 272-276 



Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 



Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen -binding domain protein. 

amino acids 127-169 



Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE 1Q 



AAT T C AGAT T T T AAGCC C AT T C T G C AG T G GAAT T T CAT GAAC T AGC AAG AGGAC AC CAT C T T C T T 
GTATTATACAAGAAAGGAGT GTACC TAT CACACACAGGGGGAAAAATGCT CT T TTGGGT GC TAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTGAAGACATCACTG 
ATAAGTACATTTTTATCACTGGATGTGACTCGGGCTTTGGAAACTTGGCAGCCAGAACTTTTGAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AAC C T CAGAGAGAC T TCGT ACTGTGC T TCT GGATGTGACCGACCCAGAGAATGTCAAGAGGAC T G 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
y 5 CC C GGCGTGC TGGC TCCCACTGAC TGGC TGACAC TAGAGGACT ACAGAGAAC C TAT TGAAGTGAA 

O C C TGTT T GGACTCATCAGT GT GACACTAAATATGC T TCCTTT GGT CAAGAAAGC TCAAGGGAGAG 

W TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 

m 

« , TATGCAGTGGAAGGTTTCAATGACAGCTTAAGACGGGACATGAAAGCTTTTGGTGTGCACGTCTC 

•Stir 

% I ATGCAT TGAACCAGGAT TGTTCAAAACAAACT TGGCAGATCCAGTAAAGGTAATTGAAAAAAAAC 

!4 TC GCCATT T GGGAGCAGCTGTCT CCAGACAT CAAACAACAAT ATGGAGAAGGT TACATT GAAAAA 

4* AGTC TAGACAAAC T GAAAGGCAATAAAT CCTATGT GAACAT GGACC TCT C T CCGGT GGTAGAGTG 

^ CATGGACCACGCTCTAACAAGTCTCTTCCCTAAGACTCATTATGCCGCTGGJ\AAAGATGCCAAAA 

m TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 

.%M 

r ?J * GCAGAGCTGGCTAATCCCAAGGCAGTGTGACTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
W 

i| AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
Gj CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 
p ^ CCCTGCTGAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
ATCATCTCTTATCTAAATATTAAAAGATAAGTCAACCCAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAA 
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FIGURE 20 

MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
STALKAETSERLRTVLLDVTDPEWKRTAQW^ 

REPIEVNLFGLISVTLmLPLVKKAQGRVINVSSVGGRLAIVGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSC I E PGL FKTNLADPVKVI EKKLAI WEQL S PD I KQQ YGEG YI EKS LDKLKGNKS YVNMD 
LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPK^ 



Important features of the protein: 



* Signal peptide: 

ff«S 

$xw amino acids 1-17 



iy 



Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 



W Glycosaminoglycan attachment site. 



amino acids 39-42 
N-myristoylation sites. 

amino acids 36-41, 42-47, 108-113, 166-171, 198-203 and 207-212 
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FIGURE 21 

CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CAC TC G C T T T CCAGCAC C T C AAC ACGGACT C GGAC AC GGAAGG TTTTCTTCTT GGGGAAG T AAAA 
GG TGAAGC CAAGAACAGCAT TAC T GAT T CCCAAATGGATGATGT TGAAGTT GT TTATACAAT T GA 
CAT TCAGAAATATAT T C CATGC TATCAGCT T TT TAGC T T T TATAAT TCT TCAGGC GAAG T AAATG 
AGCAAGCAC TGAAGAAAATAT TAT CAAAT G T CAAAAAGAAT GTGGTAGGTT GG TACAAAT T CCGT 
CGTCATTCAGATCAGAT CATGACGT T TAGAGAGAGGC T GCT T CACAAAAAC T TGCAGGAGCAT T T 
T TCAAACCAAGACCT TGTTTTTCTGC TAT TAACACCAAGTATAATAACAGAAAGC T GCT CT AC TC 
ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T TT TAGCCGAGCAGTACAAACACACAGCTCTAAAT TT TTTGAAGAAGATGGATCC TTAAAGGAGG 
TACATAAGATAAATGAAAT GTAT GC T T CAT TACAAGAGGAAT TAAAGAG TATATGCAAAAAAG TG 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAACGAGAAATTGA 
GAAAAGGAGAGGAGCACAGAT TCAGGCAGCAAGAGAGAAGAACAT CCAAAAAGAC CC TCAGGAGA 
ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAATCTGACCTTAATGGTAGAACACACTGACATTCCTGAAGCTAGTCCAGCTAGTACACCAC 
AAATC AT T AAGC AT AAAGCCT T AGAC T T AGATGACAGATGGCAAT T CAAGAGATC TCGG T T GT T A 
GATACACAAGACAAACGATCTAAAGCAAATACTGGTAGTAGTAACCAAGATAAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGAT GAAGAAAT T GAAAAGAT GAAGGGT T T TGG TGAATAT TCAC GGTCTC 
C TACAT TT TGAT CC T T T TAACC TTACAAGGAGAT T T T T TTAT TT GGC T GATGGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
TG TT TGCAGTAATACACAGATAAC TC T TAGT GCATT TACT T CACAAAGTACT TT T T CAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACATTCTTTAGAATTGGAAAAGTGAGACCAGGCACAGTGGCTCACACCTGTAATCCCAGCACT 
TAGGGAAGACAAGTCAGGAGGAT TGAT TGAAGCTAGGAGTTAGAGACCAGCCTGGGCAACGTATT 
GAGACCATGTCTATTAAAAAATAAAATGGAAAAGCAAGAATAGCC TTAT TTTCAAAATATGGAAA 
GAAATTTATATGAAAATTTATCTGAGTCATTAAAATTCTCCTTAAGTGATACTTTTTTAGAAGTA 
CATTATGGC T AGAGT T GCCAGATAAAAT GCT GGATAT CAT GCAATAAAT TT GCAAAACATCATC T 
AAAATTTAAAAAAAAAAAAAAAAAAAAAA . 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFS FYNS SGEVNEQALKKI L SNVKKNWGWYKFRRHS DQ IMT FRERLLHKNLQEHFSMQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
QI QAAREKN I QKDPQENI FLCQALRT FFPNSE FLHS CVMS LKNRHVSKS S CNYNHHLDWDNLTL 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site, 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCCTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCG CCGCGCT GCT GGCT GC CGAGCT CAAGT CGAAAAGTT GCTCGGAAGT GCGAC GT CT TTACGT GT CCAAAGGCTT C 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
CAAGAGATGGAGGAGAAGTACAGC CT GCAAAGTAAAGAT GATTTCAAAAGT GTGGT CAGC GAACAGT GCAAT CATTT G 
W CAAGCTGTCTTTGCTTCACGTTACAAGAAGTTTGATGAATTCTTCAAAGAACTACTTGAAAATGCAGAGAAATCCCTG 
t? AATGATATGTTTGTGAAGACATATGGCCATTTATACATGCAAAATTCTGAGCTATTTAAAGATCTCTTCGTAGAGTTG 

fin 

f I AAACGTTACTAC GT GGT GGGAAAT GT GAACCT GGAAGAAAT GCTAAATGACTTCT GGGCT CGCCT CCTGGAGCGGAT G 

W TT CCGC C TGGT GAACT C CCAGTAC CACTTTACAGAT GAGTAT CT GGAAT GTGT GAGCAAGTATAC GGAG CAGCT GAAG 

M 

■ ■* CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGCGGTTGCGGGAGATGTCGTGAGCAAGGTCTCCGTGGTAAACCCCACAGCCCAGTGTACCCATGCCCTGTTGAAG 
# ;: AT GAT CTACTGCTCC CACT GCC GGGGT CT CGTGACT GTGAAGC CAT GT TACAACTACT GCT CAAACATCAT GAGAGGC 

8 T GTTT GGCCAAC CAAGGGGAT CT C GATTT T GAAT GGAACAATTT CAT AGATGCTAT GCT GAT GGT GGCAGAGAGGCTA 

M GAGGGTCCTTTCAACATTGAATCGGTCATGGATCCCATCGATGTGAAGATTTCTGATGCTATTATGAACATGCAGGAT 

Vfi 

*? s - AATAGTGTTCAAGTGTCTCAGAAGGTTTTCCAGGGATGTGGACCCCCCAAGCCCCTCCCAGCTGGACGAATTTCTCGT 
TCCAT CTCT GAAAGT GCCTT CAGT GCT CGCTT CAGAC CACAT CACCC C GAGGAACGC CCAAC CAC AGCAGCT GGCACT 
&f AGTTTGGACCGACTGGTTACTGATGTCAAGGAGAAACTGAAACAGGCCAAGAAATTCTGGTCCTCCCTTCCGAGCAAC 

M GT TT GCAACGAT GAGAGGAT GG C T GCAGGAAACGGCAAT GAGGAT GACT GTTGGAAT GGGAAAGGCAAAAGCAGGTAC 

f S 5 

M CT GT TTGCAGTGACAGGAAATG GATT AGCCAACCAGGGCAACAACCCAGAGGT CCAGGT T GACACCAGCAAACCAGAC 

ATACT GATCCTT C GT CAAAT CAT GGCT CTT CGAGT GATGACCAGCAAGAT GAAGAATGCATACAAT GGGAAC GACGT G 
GACT T CTTT GATATCAGT GAT GAAAGTAGT GGAGAAGGAAGT GGAAGT GGCT GT GAGTAT CAGCAGT GCCCTTCAGAG 
TTTGACTACAAT GC CACT GACCATGCT GGGAAGAGTGCCAAT GAGAAAGC CGACAGTGCT GGT GT CCGT CCT GGGGCA 
CAGGCCTACCTC CT CACT GT CTT CT GCAT CTT GTT CCTG GTT AT GCAGAGAGAGTGGAG AT&AT T CT CAAACT CT GAG 
AAAAAGT GT T CAT CAAAAAGTTAAAAGGCACCAGTTAT CACTT T T CTACCAT CCTAGT GACTTT G CTT TTT AAAT GAA 
TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACT GT GCAT T GAGTTGGTT C CTGCT CCCC CAAACCAT GT TAAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TT TTTTT C CAACTGT GAT CT CGCCTTGT TT CTTACAAGCAAAC CAGGGT CCCTT CTTGGCACGT AACAT GT ACGTAT T 
TCTGAAATATTAAATAGCT GTACAGAAGCAGGT TTT ATTT AT CAT GT TAT CT TATTAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTLAVLSAALIJU^ELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 

CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 

LYMQNSELFKDLFVELKRYYWGNWLEmLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 

QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSWNPTAQCTHALLKMIYCSHCRGL 

VTWPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAERLEGPFNIESVM 

MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 

EKLKQAKKFWSSLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVTGNGLANQGNNPEVQVDTS 

KPDILILRQ IMALRVMT SKMKNAYNGNDVD FFDISDESSGEGSGSGCEYQQCPSE FD YNAT DHAG 

KSANEKADSAGVRPGAQAYLLTVFCI LFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A <P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
C TGAGTCAT CCCCAGGGAT CAGGAGC CT CCAGCAGGGAACC T TCCATTAT AT TCT T CAAGCAAC T 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTG TAG GAGCTCTG 
H AGCGCCCACTCTTCCAATTAAACATTCTCAGCCAAGAAGACAGTGAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 

ri 

JJJ TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
y GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
%| AGCTAGT GT CATT TAACCTTAAAT GCAAT CAGGAAAGTAGCAAACAGAAG TCAATAAAT AT TT T T 

N; AAAT GT CAAAAAAAAAAAAAAAAAA 

Si 

Q 

w 
cs 

m 
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FIGURE 26 



MKVXISSLLLLLPLMIMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLl^PRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 



8^ 



Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 

AGCTGGTCTGCCATGGACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 

CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 

TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 

ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 

CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 

AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 

GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 

CTCTGTGCAGAGCCCAAGGAAGGTCCTGCAGGAGGTCCGGAGAGTACTGAGACCGGGAGGTGTGC 

TCTTTTTCTGGGAGCATGTGGCAGAACCATATGGAAGCTGGGCCTTCATGTGGCAGCAAGTTTTC 

GAGCCCACC T GGAAACACATT GGGGAT GGC T GC TGCC TCAC CAGAGAGACCTGGAAGGAT CT TGA 

|y GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 

>J GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 

^ TT CCC CAGCC TCCAAT TAGAACAAGCCACCCACCAGCCTAT CTATC T T C CAC T GAGAGGGACC TA 

— 

GCAGAATGAGAGAAGACATTCATGTACCACCTACTAGTCCCTCTCTCCCCAACCTCTGCCAGGGC 
m AATCTCTAACTTCAATCCCGCCTTCGACAGTGAAAAAGCTCTACTTCTACGCTGACCCAGGGAGG 
Iff AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 



CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 
CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACT CAAAGAGGCAGAGGT TT TGT TC TCAAATAT TT T T TAATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHimLLGCWQPLCKSYFPYLMAVLTPKSNRKMSSKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSLIAENRHLQYERFWAPGEDM 
RQLADGSMDVWCTLVLCS VQS PRKVLQEVRRVLRPGGVLFFWEHVAEPYGSWAEMWQQVFE PTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLEQATHQPIYLPLRGT 



w 



Important features: 
Signal peptide: 

amino acids 1-23 

Leucine zipper pattern . 

amino acids 10-32 



^ N-myristoylation sites. 

^ amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 

CAATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTT GTT TATGATTC TTAT AC T AAT T TATACAAAGATAT TAAGGCCC T GT TCAT TAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
AGT TAAAAGCTGAAAAAA^AAAAAAAAAAAAAAAAAAA 
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FIGURE so 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLP'HRPAMTCSQAQPRG 
EGEKVGDG 

Important features: 
Signal peptide: 

amino acids 1-15 



PI 



|4 



Growth factor and cytokines receptors family: 



jh* amino acids 3-18 
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FIGURE 21 



W 



fil 



GTTTGAATTCCTTCAACTATACCCACAGTCCAAAAGCAGACTCACTGTGTCCCAGGCTACCAGTT 
CC TCCAAGCAAGTCAT T T CC C T TAT T T AACCGAT G TGT C CCT CAAACACCTGAGTGCTAC T CCC T 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATC ATG TCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTC T TGAT TTTTGTTCT CAGAAAGAGAAT AAAAT T GACAGT TGAGCT T TTCCAAAT CAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 
AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 
TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 
TCCATTCTCTTCTTCTACCATCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 
GATTCCGAGAATCAT TGT CATGTACATGCAAAAC GCACT GAAAGAACAGCAGCATGGTGCAT TGT 
CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 
AAC C AG AAT G CAT AT AC T AC AAC T GC T AT T AAT G GG AC AGAT T T C T G T AC AT CAGC AAAAG AT GC 
ATTCAAAATCT TGTCCAAGAACTCAAGTCAC TT TACATC TATTAACTGCTTTGGAGACTT CATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTG7VAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
TGGAAACAAAT GATGGAT CGTCAGAAAAGCCC T ACT T TAT GGATCAAGAAT T TCTGAGT T TCGTA 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAACTCCAGGCCATTGTGAG ATAGA TACCCAT T TAGGTATCT GTACCTGGAAAACAT T 
TCC T TC TAAGAGCCAT T TACAGAATAGAAGATGAGACCAC T AGAGAAAAGT TAGTGAAT TT TT TT 
T T AAAAG AC C T AAT AAAC C C TAT T C T T C C T C AAAA 
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FIGURE 32 

MS GRDT I LGLC ILALALS LAMMFTFRFI TTLLVHI FI S LVI LGLL FVCGVLWWLYYDYTNDLS IE 
LDTERENMKCVLGFAIVSTGITAVLLVLIFVLRKRIKLTVELFQITNKAISSAPFLLFQPLWTFA 
ILIFFWVLWVAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYMWSYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTVVKGSFLISVVRIPRI IVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
FI I FLGKVLWC FT VFGGLMAFNYNRAFQVWAVPLLLVAFFAYLVAHS FLSVFETVLDAL FLC FA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

Important features: 
Signal peptide: 

amino acids 1-20 

Putative transmembrane domains: 

amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

amino acids 204-208, 295-299, 313-317 

N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GT TCGAT TAGC TCCTCT GAGAAGAAGAGAAAAGGT TCT T GGACC T CTCCCTGTTTCTTCCT TAGA 
ATAATTTGTATGGGATTTGTGATGCAGGAAAGCCTAAGGGAAAAAGAATATTCATTCTGTGTGGT 
GAAAAT T T T T T GAAAAAAAAAT T GC G T T C T T C AAAC AAG GG T G T CAT TCT GAT AT T T ATGA GG AC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT T CAAACAAAGAAACGGCAAAGAAGAT T AAAAGGC CCAAGT T CAC TGT GCCT CAGATCAACT GC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGT TACAAAGGGAGT TAT T CCAACGGT GT CCAATCGTT AT CCC TAC CACGAT GGAGAGAAT CCT T 
TATCGTCTTAGAAAGTAAACCCAAAAAGGGTGTAACCTACCCATCAGCTCTTACATACTCATCAT 
CGAAAAGT CCAGC T GCCCAAGCAGGTGAGACCACAAAAGCC TATCAGAGGCCACCTAT T CCAGGG 
ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
f*fe CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
fi GCCACAGGAGCCAGGAGATGGATCTCTGGTCCACTGCCACCTACACAAGCAGCCAAAACAGGCCC 
35 AGAGCTGATCCAGGTATCCAAAGGCAAGATCCTTCAGGAGCTGCCTTCCAGAAACCTGTTGGAGC 
JJ GGATGT CAGCCTGGGACTTGTTCCAAAAGAAGAAT TGAGCACACAGTCT TTGGAGCCAGTATCCC 

IP T GGGAGAT C CAAAC T GCAAAA.T T GAC TTGTCGTTTT T AAT T GAT GGGAGCACC AGC AT T G GCAAA 

|d CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
%l CGG TCCACTGATGGGTGT TGTCCAGTAT GGAGACAACCCT GC TACTCACT T TAACC TCAAGACAC 

y ACACGAATTC T CGAGAT CTGAAGACAGCCATAGAGAAAAT TACTCAGAGAGGAGGAC TT T C TAAT 

^ GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
*N GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
f. GAC T TGC GAGAGAG T C AGGAAT CAACAT T T T C T TCATCAC CAT T GAAGG T G C T GC T GAAAAT GAG 

| : 3 AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
jm GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
j£ ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
Hi AGAGTT T GAGAT T TCCGACACGGACAC GCGCATCGGGGCCGT GCAGTACACC TACGAACAGCGGC 

f;J TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGTTAATGATCCTCATCACCGACGGGAGGTCCTACGACGACGTCC 
GGAT CCCAGCCATGGCTGCCCATC T GAAGGGAGTGATCACC TATGCGATAGGCGT T GCCT GGGC T 
GCCCAAGAGGAGCTAGAAGTCATTGCCACTCACCCCGCCAGAGACCACTCCTTCTTTGTGGACGA 
GT TTGACAACCTCCATCAGTATGTCCCCAGGATCATCCAGAACAT TTGTACAGAGTTCAACTCAC 
AGCC TCG GAAC TGAAT TCAGAGCAGGCAGAGCACCAGCAAGTGC TGC TT T ACTAAC TGACG TGT T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAAT GT C T TGT TAT TAT T CT TT GCCATCAT GCT TT T TCATAT T CCAAAACT T GGAGT TACAAAGA 
TGAT CACAAACGTATAGAAT GAGCCAAAAGGCTACAT CAT GT T GAGGGT GC T GGAGAT T T TACAT 
T T TGACAAT T GTT T T CAAAATAAATGTT CGGAATACAGT GCAGCCC T TACGACAGGC T TACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T TGT CATGACAAT GTAGGAAT TGC TGAAT TAAAT GT T TAGAAGGATGAAAAATAAAAAAAA&A 
AAAAAAAAAAAAAAAZW^AAAAAAAJ^ 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 34 

MRTWLTMKASVIEMFLVLLVTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVYGTDVYAS YS SVCGAAVHSGVLDNSGGK I LVRKVAGQSGYKGS YSNGVQS LSLPRWR 
ESFIVLESKPKKGVTYPSALTYSSSKSPAAQAGETTKAYQRPPIPGTTAQPVTLMQLLAVTVAVA 
TPTTLPRPSPSAASTTSIPRPQSVGHRSQEMDLWSTATYTSSQNRPRADPGIQRQDPSGAAFQKP 
VGADVSLGLVPKEELSTQSLEPVSLGDPNCKIDLSFLIDGSTSIGKRRFRIQKQLLADVAQALDI 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
RS GAPNWWMVDGWPTDKVEEASRLARES G IN I FF I T I EGAAENEKQYWEPNFANKAVCRTNG 
FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 
FKKSKPNKRKLMILITDGRSYDDVRI PAMAAHLKGVITYAIGVAWAAQEELEVIATHPARDHSFF 
VDE FDNLHQ YVPR 1 1 QN I CTE FNS QPRN 

Important features : 
Signal peptide: 

amino acids 1-26 

Transmembrane domain: 

amino acids 181-2 00 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myristoylation sites. 

amino acids 23-29, 93-99, 115-121, 262-268, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE 35 



CCGAGCACAGGAGATTGCCTGCGTTTAGGAGGTGGCTGCGTTGTGGGAAAAGCTATCAAGGAAGAAATTGC 
CAAACC AT GTCTTTTTTTCT GTT TTC AG AGT AGT T C AC AAC AGATCTG AGTGT TTT AATT AAGCATGG AAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CT CAGC CT TCCCCACT AC AAT GTG AT AGAACGC GT GAACT GG AT GT AC TT CT ATGAGT AT GAGCCGATTT A 
CAGACAAGACTT TC ACTT C AC ACT TCGAG AGC AT T C AAAC T GCT CTC ATC AAAATCCATTTCTGGTCATT C 
TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 

U 

L... TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 

P 

Ztt AT TGTCCT T AGAGGATGAACAC CT TCT TT ATG GT GAC ATAATCC G ACAAGAT TTTTT AGACACAT ATAAT A 

w 

ft« ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 
I rr j AAGAC AGACACT GATGTT TT CATC AAT ACT GGC AAT T T AGTGAAGT AT CTTTT AAACCT AAACC ACT CAGA 

%j GAAGT T TT TCAC AGGTT ATCCTCT AATTGAT AATT ATT CC TAT AGAGG AT TT T ACC AAAAAACCCAT ATT T 

^ CTTACCAGGAGTATCCTTTCAAGGTGTTCCCTCCATACTGCAGTGGGTTGGGTTATATAATGTCCAGAGAT 

TTGGTGCC AAGGAT CT ATGAAATGATGGGT CACGTAAAACCC AT CAAGTT TG AAGATGT TT ATGT CGGGAT 
* CTGTTT GAATTT ATT AAAAGTG AAC ATTCATAT TC C AGAAGACACAAATCTTTTCT T TCT AT AT AG AAT CC 

JV-J ATTT GGAT GT CT GT CAAC TGAGAC GT GTGATT GC AGCCCAT GGCT T TT CT T CCAAGGAGATCAT CACT TT T 

T G GC AGGT CAT GCT AAGGAACACC AC ATGCCAT T AT TAAC T T CACATT CT ACAAAAAGCC TAGAAGGAC AG 

UJ - 

^ GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 
f?* AC AC TG AACT GAAACT CAT GAAAAACCC AG ACT GGAGACT GGAG GGT T AC ACTT GT GAT T T ATT AGTC AGG 

MS CCCTTCAAAGATGATATGTGGAGGAATTAAATATAAAGGAATTGGAGGTTTTTGCTAAAGAAATTAATAGG 
ACCAAAC AATTT GGACATGT CATT CT GT AG ACT AG AAT TT CTT AAAAGGGT GTT ACTGAGTT AT AAGCT C A 
CT AGGCTGT AAAAAC AAAAC AATGTAGAGT TTT AT T T ATTGAAC AAT GT AGTCACT T GAAGGT T TT GT GT A 
TAT CTT AT GT GGAT T ACC AAT TT AAAAAT AT ATGT AGT TCTGTGT CAAAAAACT TCTT C ACT GAAGTT AT A 
CTGAACAAAATTTTACCTGTTTTTGGTCATTTATAAAGTACTTCAAGATGTTGCAGTATTTCACAGTTATT 
AT T ATT T AAAAT T ACT TCAACT T T GTGTT T TT AAAT GT T TTGACGATT TC AAT ACAAGAT AAAAAGGAT AG 
TGAATC AT TCTT TACAT G CAAAC ATT TT CC AGT T AC TT AACT GAT C AGTTT AT T AT TGATACAT C ACT C CA 
TT AATGT AAAGT CAT AGGTCATT ATT GC AT AT C AGT AATCT C TT GGAC TT TGTT AAAT AT TT T ACT GT GGT 
AATATAGAGAAGAATTAAAGCAAGAAAATCTGAAAA 
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FIGURE 26 

MASALWTVLPSRMSLRSLKWSLLLLSLL^ 

TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEi\EKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTIMAFRWTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPLIDNYSYRGFYQKTHISYQEYPFKVFPPYCSGLGYIMSRDLVPRIYEMMGHV 
KPIKFEDVYVGICLNLLKVNIHIPEDTNLFFLYRIHLDVCQLRRVIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 

N-glycosylation sites. 

amino acids 72-76, 154-158, 198-202, 212-216, 326-330 

Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins . 

amino acids 23-37 

N-myristoylation site. 

amino acids 271-277 
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FIGURE 37 



C GC TCGGGCAC CAGCC GCGGC AAGGATGGAGC TGGGTTGC T GGACGC AGTT GGGGC T CAC T TT TC TTC AGC TCCT T C TCATC 
T CGTCCT TGCCAAGAGAGTACAC AG TCATTAATGAAGCC T GCCC TGGAGCAGAGTGGAATATCATGTGTCGGGAGTGC TGTG 
AATATGATCAGAT TGAGTGCGTC TGCCCCGGAAAGAGGGAAGTCGTGGGTT ATAC CATCC C T TGC TGC AGGAATGAGGAGAA 
T GAGTGTGACTCC TGC CTGAT CCACCCAGGT TGTACC ATC T T TGAAAAC TGC AAGAGC TGC CGAAATGGCTCATGGGGGGGT 
ACC TT GGATGAC TTC TATGT GAAGGGGT T C TAG TGTGCAGAGTGCCGAGCAGGCTGGTAC GGAGGAGAC TGCATGCGATGTG 
GCCAGGT TC TGCGAGCCCC AAAGGGTCAGATT T TGTTGGAAAGC TATCCCC T AAAT GC TCACT GT GAATGGACCAT T CATGC 
TAAACCTGGGT T TGTCATCC AAC TAAGATTTGTCATGTTGAGTCTGGAGTT TGAC TACATGTGCCAGTATGACTATGTTGAG 
GT TCGTGATGGAGACAACCGC GATGGCCAGAT CAT CAAGC GTGTC TGTGGC AACGAGCGGC CAGC TCC TAT CCAGAGCATAG 
GATCC TCACTCCACGTCCTCTTCCACTCCGATGGC TC CAAGAAT TTT GACGGT TT C C ATGCC ATT TATGAGGAGAT CACAGC 
ATGC TCCTCATC CCC T TGT T TCCATGACGGCACGT GCGTCC T TGACAAGGC TGGATCTTACAAGTGTGCCT GCTTGGCAGGC 
TATAC TGGGCAGCGCTGTGAAAATC TCC TTGAAGAAAGAAACTGC TCAGAC CC TG GGGGCCCAGTCAAT GGGTAC CAGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGG<:ACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
TC TTAGTGGCAATGAGAAAAGAAC T TGCCAGCAGAAT GGAGAGTGGTC AGGGAAAC AGCC CATCTGC ATAAAAGCC T GCCGA 
GAAC CAAAGAT T TCAGACC T GGTGAGAAGGAGAGT TC TTCC GATGCAGGTT CAGT CAAGGGAGACACCATT ACACCAGC TAT 
AC TCAGCGGCC T TCAGC AAGC AGAAAC T GCAGAGTGC CCC TACC AAGAAGCCAGCCC TTC C C T TTGGAGATC TGCCCATGGG 
ATACCAACATCTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAGGAGGACATGT 
C TGAGGACT GGGAAGTGGAGT GGGCGGGCAC C AT CCTGC ATCCC TATCTGCGGGAAAATTGAGAACAT CACTGCTC CAAAGA 
CCCAAGGGT TGCGCTGGCCGTGGCAGGC AGC CATC TACAGGAGGAC C AGCGGGGT GC ATGACGGCAGC CTAC ACAAGGGAGC 
GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 
GTCACCATGAT CAAGACAGC AGACC TGAAAGT TGTT T TGGGGAAATT C TAC CGGGATGATGACCGGGAT GAGAAGACCATCC 
AGAGCCTACAGATT TC TGC TATCAT TCTGCATCCC AACTATGACCCC AT CC TGCTT GATGC TGAC AT C GC CATCC TGAAGC T 
CC TAGACAAGGC CCGTATCAGCACCCGAGTC C AGCCCAT CTGCCTCGCTGC CAGTC GGGAT CTCAGCACT TCCTTCCAGGAG 
TCCCACATCACTGTGGC TGGC TGGAATGTCCT GGC AGAC GTGAGGAGC C CT GGC TTCAAGAACGACAC AC TGCGC TC TGGGG 
TGGT C AGTGTGGTGGAC TC GC TGC T GTGTGAGGAGCAGC ATGAGGAC C ATGGCATC CCAG TGAGTGTCAC TGATAACAT GT T 
C TGT GCCAGCTGGGAAC CC AC TGC C CCT TC TGATATC TGC AC TGC AGAGACAGGAGGCATCGC GGCTGTGTCCTT CCCGGGA 
CGAGCATC TCC T GAGC C ACGC TGGCATC TGAT GGGAC TGGTCAGC TGGAGC TAT GATAAAACATGCAGCCACAGGCT CTCCA 
CTGC C TTCACC AAGGTGCTGCCT T T TAAAGAC TGGAT TGAAAGAAATATGAAATGAACC AT GC TCATGC ACT CCT TGAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTGCCAGGGC T TC TGAC TTC AGGGACAAAAC TCAGT GAAGGGTGAGT AGAC C T CC ATTGC T GGTAGGC TGATGCC GC GTCC A 
C TAC TAGGACAGCCAAT TGGAAGATGCC AGGGC TT GCAAGAAGTAAG TTTC T T CAAAGAAGAC CATATACAAAACCTCTCCA 
C TCCACTGACC T GGTGGTC TTCCCC AAC T TTCAGT TATAC GAATGCC ATCAGC TTGACC AGGGAAGAT CTGGGCT TCATGAG 
GCCCCTT TTGAGGCTC TCAAG TTC TAGAGAGC TGC C T GTGGGACAGC CCAGGGCAGCAGAGCT GGGAT GTGGTGC ATGC C T T 
T GTGTACATGGC CAC AG TAC AGTC TGG TCCT T TTC C T TCC CCATC TC TTGT ACAC AT T T TAAT AAAATAAGGGTTGGCT T C T 
GAAC TAC AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAP I QS I GS SLHVL FHSDGSKNFDGFHAI YEE I TAC S S S PCFHDGTCVLDKAGS YKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGE WS GKQP I C IKACRE PKI SDLVRRRVLPMQVQSRE T PLHQLYSAAFSKQKLQS APTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGPCVTMIKTADL 
KWLGKFYRDDDRDEKT IQSLQI SAI I LHPNYDP I LLDADI AI LKLLDKARI S TRVQP I CLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of the protein: 
Signal peptide: 

amino acids 1-23 

EGF-like domain cysteine pattern signature, 
amino acids 260-272 
N-glycosylation sites . 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



GGTTC CT ACAT C CT CT CAT CT GAGAAT CAGAGAGCATAAT CTT CTTACGGGCC CGT GAT TTATTAACGT GGCTTAAT C 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCCCTTGTAGCTGACAGAAGGTGGCCAGGGAGAATGCAGCACACTGCTCGGAGAATGAAGGCGCTTCTGTTGC 
TGGTCTTGCCTTGGCTCAGTCCTGCTAACTACATTGACAATGTGGGCAACCTGCACTTCCTGTATTCAGAACTCTGTA 
AAGGT GCCTC CCACT ACGG C CT GACCAAAGAT AGGAAGAGGC GCT CACAAGATGGCT GT C CAGACGGCT GTGCGAGC C 
T CACAGCCAC GGCT CCCT CCCCAGAGGTTT CT GCAGCT GCCAC CATCT CCTTAAT GACAGACGAGCCT GGCCT AGACA 
ACC CT GCCTACGT GT CCT CGGCAGAGGAC G GGCAGCCAGCAAT CAGCCCAGT GGACT CT GGCC GGAGCAACC GAACTA 
GGGCACGG CCCTTT GAGAGAT C CACT ATT AGAAGCAGAT CATTTAAAAAAATAAATCGAGCTT T GAGTGTTCTT CGAA 
GGACAAAGAGCGGGAGT GCAGT T GCCAAC CAT GC CGAC CAGGG CAGGGAAAATT CTGAAAACACCACTGC CCCT GAAG 
TCTT T CCAAG GT T GTACCAC CT GATTCCAGAT GGT GAAAT TAC CAGCAT CAAGAT CAATCGAGTAGAT CC CAGT GAAA 
GCCT CTCTATTAGG CT GGT GGGAG GT AGCGAAAC C CCACT GGT CCATAT CAT TAT CCAACACATTTAT CGTGAT GGGG 
TGATCGCCAGAGACGGCCGGCTACTGCCAGGAGACATCATTCTAAAGGTCAACGGGATGGACATCAGCAATGTCCCTC 
ACAACTACGCTGTGCGTCTCCTGCGGCAGCCCTGCCAGGTGCTGTGGCTGACTGTGATGCGTGAACAGAAGTTCCGCA 
GCAGGAACAATG GACAGGCCCC GGAT GCCTACAGACC C C GAGATGACAGCTTT CATGT GATT CT CAACAAAAGTAGCC 
CCGAGGAG CAGCT T GGAATAAAACT G GT GCGCAAGGT GGATGAGCCT G GGGTTT T CAT CTTCAAT GT GCT GGAT GGC G 
GT GT GGCATAT CGACAT GGTCAGCTTGAG GAGAAT GACCGTGT GT TAGCCATCAAT GGACATGAT CTT CGATAT GGCA 
GCCCAGAAAGTGCGGCTCATCTGATTCAGGCCAGTGAAAGACGTGTTCACCTCGTCGTGTCCCGCCAGGTTCGGCAGC 
GGAGCCCTGACAT CTTTCAGGAAGCCGGCT GGAACAGCAATGGCAGCT GGT CCCCAGGGC CAGGGGAGAGGAGCAACA 
CT C CCAAGCC CCT CCATCCTACAATTACT TGT CAT GAGAAGGT GGTAAAT AT CCAAAAAGACCCCGGT GAAT CTCTCG 
GCATGACC GT CGCAGGGGGAGCAT CACATAGAGAAT GGGATTT GCCTAT CTATGT CAT CAGT GT T GAGCC CGGAGGAG 
T CATAAGCAGAGAT GGAAGAATAAAAACAGGT GACATTT T GT T GAATGT GGAT G GGGT CGAACT GACAGAGGT CAGC C 
GGAGTGAGGCAGTGGCATTATTGAAAAGAACATCATCCTCGATAGTACTCAAAGCTTTGGAAGTCAAAGAGTATGAGC 
C CCAGGAAGACT GCAGCAGCC CAGCAGCCC T GGACT C CAAC CACAACATGGCCCCACC CAGT GACT GGTCCC CAT CCT 
GGGT CAT GT GGCT GGAATTACCACG GT GCT T GTAT AACT GTAAAGAT AT T GTATTACGAAGAAACACAGCT GGAAGT C 
T GG GCTT CTG CATTGTAGGAGGTT AT GAAGAATACAATGGAAACAAAC CTTT TTTCATCAAAT CCATT GTTGAAG GAA 
CAC CAGCAT ACAAT GAT GGAAGAATTAGAT GT GGT GAT AT T CTTCTT GCTGT CAAT GGTAGAAGTACATCAGGAAT GA 
TACAT GCTTGCTT GGCAAGACTGCT GAAAGAACTT AAAGGAAGAATT ACT CTAACTATTGT TT CTT GGCCT GGCACTT 
TTTT ATAGA AT CAAT GAT GGGT CAGAGGAAAACAGAAAAATCACAAAT AGGCTAAGAAGTT GAAACACTATATTT ATC 
TT GTCAGTTT TT ATAT TTAAAGAAAGAATACATTGTAAAAAT GT CAGGAAAAGTAT GATCAT CTAAT GAAAGCCAGTT 
ACACCT CAGAAAATAT GATT CCAAAAAAATTAAAACT ACT AGT TT TT TTT CAGT GT GGAGGAT TT CT CAT TACTCTAC 
AAC ATT GTTT ATATTTTT T CT AT TCAATAAAAAGCC CTAAAACAACTAAAAT GATT GATTT GTATACCCCACT GAATT 
CAAGCT GATTTAAATTTAAAATT TGGTATAT GCTGAAGT CT GC CAAGGGTACATTATGGCCAT TTTT AAT T TACAGCT 
AAAATATTTTTTAAAAT GCATT GCT GAGAAAC GTT GCTTT CAT CAAACAAGAATAAAT ATTTT TCAGAAGTTAAA 



FIGURE 40 

MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAAT I S LMTDEPGLDNPAYVS SAEDGQPAI S PVDSGRSNRTRARPFERS TIRSRS FKKINR 
AL S VLRRT KS GS AVANHADQGRENS ENT T APE VFPRL YHL IPDGEITSIKI NRVDP S E S L S I RL V 
GGSETPLVHII IQHIYRDGVIARDGRLLPGDI ILKVNGMDISNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEEQLGIKLVRKVDEPGVFIFNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGS PESAAHLI QASERRVHLWSRQVRQRS PD I FQEAGWNSNGS WS PG 
PGERSNTPKPLHPTITCHEKWNIQKDPGESLGMTVAGGASHREWDLPIYVISVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112, 157-161, 289-293, 384-388 

Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 

N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE Al 



AC C AGGC AT TGT ATC T T CAGT T GTCATC AAGT T C GCAAT CAGAT T GGAAAAG C T C AACT T GAAGC TT T 
C T T GCCT GCAGT GAAGCAGAGAGATAGATATTATT CACGTAATAAAAAACATGGGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
C AC CAGT AACTAC TT C GT G GGT G CCAT T C AAGAGAT T CC T AAAGCAAAG GAGT T CAT GGCT AAT T TCC 
ATAAGACCCTCATTTTGGGGAAGGGAAAAACTCTGACTAATGAAGCATCCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAA.CCAGATCTCAC 
T TTGGAAGAGGTACAGGCAGAAAAT CCCAAAGTGT CCAGAGGCCGGT AT CGCCCT CAGGAATGTAAAG 
CTT TACAGAGGGT CGCCAT CCTCGT TCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCT GGAA 
^ CAT C TGCAT CCCT TCCTGCAGAGGCAGCAGCTGGAT TATGGCATCTAC GTCAT CCACCAGGCTGAAGG 

Q TAAAAAGTTTAATCGAGCCAAACTCTTGAATGTGGGCTATCTAGAAGCCCTCAAGGAAGAAAATTGGG 
O ACT GCTTTATAT TCCACGATGTGGACCTGGTACCC GAGAATGACT TTAACCT TTACAAGTGTGAGGAG 

? f I CAT C CCAAG CAT CT GGT GG T T GG CAGGAACAGCAC T G GGT ACAGG T T AC GT T ACAGT GGAT AT T T TGG 

/ l ! GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 
s t " G GG GAGGCGAAGACGATGAC C T CAGAC TC AGG G TT GAGC T C CAAAGAAT GAAAAT TTCCCGGCCC CT G 

■e* C CT GAAGT GGGT AAATATACAAT GGT C T T CC AC AC T AGAGACAAAGG CAAT GAGGT G AAC G CAGAAC G 

s GATGAAGCTCTTACACCAAGTGT CACGAGTCTGGAGAACAGAT GGGT TGAGTAGT TGTTCT TAT AAAT 

C! TAGTATCTGTGGAACACAATCCTTTATATATCAACATCACAGTGGATTTCTGGTTTGGTGCATGACCC 
•4* TGGATCTTTTGGTGATGTTTGGAAGAACTGATTCTTTGTTTGCAATAATTTTGGCCTAGAGACTTCAA 
ATAGTAGCACACATTAAGAACCTGTTACAGCTCATTGTTGAGCTGAATTTTTCCTTTTTGTATTTTCT 
ill TAG C AGAG CTCCTGGT GAT GT AGAG T ATAAAAC AGT T GT AACAAG ACAG CT T T CT TAGTCATTTTGAT 

y 

»; CATGAGGGT TAAA.TATTGTAATATGGATACTT GAAGGACT TTATATAAAAGGATGACTCAAAGGATAA 

S : Vi? 

AATGAACGCTAT T T GAGGAC T CT GGT T GAAGGAGAT T TAT T TAAAT T T GAAG T AAT ATAT T ATGGGAT 
AAAAGGCCACAGGAAATAAGACTGCTGAATGT CTGAGAGAACCAGAGT T GTT CTCGTCCAAGGTAGAA 
AGG TACGAAGAT ACAAT AC T G TTAT TCAT T TAT C C T GTACAAT CAT C T GT GAAGT GGT G GT GT CAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGCGACGAATCAGGACACAGT GAACTT GGGAAT GAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
T TTTAAAGAGT TTTTGTAAAATGATTTTGTACAAGT AGGATAT GAAT TAGCAGTT T ACAAGTTT ACAT 
ATTAACTAATAATAAATAT GT CTATCAAATACCTCT GTAGTAAAATGTGAAAAAGCAAAA. 
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FIGURE 42 

MGFNLT FHLS YKFRLLLLLTLCLTWGWATSNYFVGAIQE I PKAKE FMANFHKTL I LGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RNREKHmYLLEHLHPFLQRQQLDYGIWIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTALSREQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEWAERMKLLHQVSRVWRTDGLSSCSYKLV 
S VEHNP L Y IN I TVD FW FGA 

Important features : 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 4-8, 220-224, 335-339 

Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 



GCTCAAGAC CCAGCAGT GGGACAGCCAGACAGAC GGCACGATGGCAC TGAGC TCCCAGAT C T GGG 

CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 

ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT ' 

GTTCCAGAGGCGAAGGAGGCGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 

ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 

TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 

TCCAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



Iwsr 
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FIGURE ±± 

MM.SSQIWAACLLLLLLIASLTSGSVFPQQTGQI^LQPQDRAGARAS^PMFQRRRRRDTHFPI 
CIFCCGCCHRSKCGMCCKT 



Important features: 
Signal peptide: 

amino acids 1-24 



m 
o 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 



'* ' amino acids 58-59 



N-myristoylation site. 

amino acids 44-50 



Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE 45 



GTGGCTTCATTTCAGTGGCTGACTTCCAGAGAGCAATATGGCTGGTTCCCCAACATGCCTCACCC 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCGCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACC TTCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTAT CATAGTGACCCAAA 
AT CGTAATAGGGAGAGAGTAGACT TCCCAGATGGAGGC T ACT CCCT GAAGCTCAGCAAACTGAAG 
AAGAAT GACT CAGGGAT CTAC TATG TGGGGATATACAGC T CATCAC TCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAAT GGCACC TGT GT GACCAATCT GACATGCT GCAT GGAACAT GGGGAAGAGGAT GT GAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
P CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
5f TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
ri GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
\f GTT TCTGAAGAGAGAGAGACAAGAAGAGTACATTGAAGAGAAGAAGAGAGT GGACAT TTGTCGGG 

H AAACTCCTAACATATGCCCCCATTCTGGAGAGAACACAGAGTACGACACAATCCCTCACACTAAT 
4 ;? AGAACAATCC TAAAGGAAGATC CAGCAAATACGGT T T AC TCCAC TGTGGAAATACCGAAAAAGAT 

1„ GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
pj TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 

6 . 
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FIGURE 46 

MAGSPTCLTLIYILWQLTGSAASGPVKELVGSVGGAVTFPLKSKVKQVDSIVWTFNTTPLVTIQP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCVTNLTCCMEHGEEDVIYTWKALGQAANESHNGSILPISWRWGESDMTFIC 
VARNPVSRNFSSPILARKLCEGAM)DPDSSMVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVD I CRE TPNICPHS GENTEYDT I PHTNRT I LKEDPANT VYS TVE I PKKMENPHS LL TMPDT 
PRLFAYENVI 

Important features: 
Signal peptide : 

amino acids 1-22 

Transmembrane domain: 

amino acids 224-250 

Leucine zipper pattern. 

amino acids 229-251 

N-glycosylation sites. 

amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 
291-295 
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FIGURE 47 

GGCTCGAGCGTTTCTGAGCCAGGGGTGACCATGACCTGCTGCGAAGGATGGACATCCTGCAATGG 
ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 
TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 
ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 
CTGCAACAACAGAACTGGAATGTTTCTTTCATCATTTTTCAGTGTGATCACAGTCATTGGTGCTC 
TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 
AACAGTAATGCCAATTGTGAATTTTCATTGAAAAACATCAGTGACATTCATCCAGAATCCTTCAA 
CTTGCAGTGGTTTTTCAATGACTCTTGTGCACCTCCTACTGGTTTCAATAAACCCACCAGTAACG 
ACACCAT GGCGAGTGGCT GGAGAGCATCTAGT T TCCACT T CGAT TC TGAAGAAAACAAACATAGG 
CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 
CAGTCAGATAGTCATCGGTTTCCTTGGCTGTCTGTGTGGAGTCTCTAAGCGAAGAAGTCAAATTG 
TG TAG TTTAATGGGAATAAAATGTAAGTATCAGTAGTTTGAAAAAAAAAAA 
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FIGURE a8 

MTCCEGWTSCNGFSLLVLLLLGWLNAI PLIVSLVEEDQFSQNPISCFEWWFPGIIGAGLMAIPA 
TTMSLTARKRACCNNRTGMFLS S FFSVI TVI GALYCMLI S IQALLKGPLMCNS PSNSNANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFDSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II}, 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 207-218 

TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 
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FIGURE J.Q 

ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGCCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCCCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGT CC TGTACCT GAGAAGGC CAT CCCAC TCATCAC TCCAGGC TCT GCCA 
CTACTTGC TGAG CACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 
TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 



APP_ID=10063714 



Page 203 



FIGURE so 



MERVTLALLLLAGLTALE ANDPFANKDDP FYYDWKNLQLSGL I CGGLLAI AG I AAVL S GKCKYKS 
SQKQHS PVPEKAI PL I TPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 

Transmembrane domain: 

amino acids 36-59 



Iff N-myristoylation sites. 

lW ■ amino acids 41-47, 45-51, 84-90 

2 , 

Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 



amino acids 54-67 
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FIGURE si 

GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 

AGGACAGGGAGTCGGAAGGAGGAGGACAGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 

AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 

GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 

CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 

GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 

GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 

AGCAGAT GC T T TGGGCAACAGGGT C GGGGAAGCAGCCCATGC T CT GGGAAACAC TGGGCAC GAGA 

T TGGCAGACAGGCAGAAGAT G T CAT TCGACACGGAGCAGATGC TGTCCGCGGC TCCTGGCAGGGG 

GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 

CCTTGGAGGCCAGGGCCAGGGCAATCCTGGAGGTCTGGGGACTCCGTGGGTCCACGGATACCCCG 

GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 

GGGCCACCAAACTTTGGGACCAACACTCAGGGAGCTGTGGCCCAGCCTGGCTATGGTTCAGTGAG 

AGCCAGCAAC CAGAATGAA.GGGT GCACGAAT CCCCCACCATCTGGC TCAGGT GGAGGCTCCAGCA 

AC T C TGG G GGAGGC AGC GG C T CACAGT C G GG C AGC AGT GG C AG T GG C AG C AAT GG T GACAAC AAC 

AATGGCAGCAGCAGTGG T GGCAGCAGCAGTGGCAGCAGCAGT GG CAGC AG C AG TGGC GGCAGCAG 

TGGCGGCAGCAGTGGTGGCAGCAGTGGCAACAGTGGTGGCAGCAGAGGTGACAGCGGCAGTGAGT 

CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 

CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 

C TT CAGAGGACAGGGAG T T T CCAGCAACATGAGGGAAAT AAGCAAAGAGGG CAATCGCCTCCT TG 

GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 

GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 

GAAT T T T AAAT C CAAGCTGGGT T TC AT CAAC TGGGAT GCCAT AAACAAGGACC AGAGAAGC T C TC 

GCATCCCGTGACCTCCAGACAAGGAGCCACCAGATTGGATGGGAGCCCCCACACTCCCTCCTTAA 

AACACCACCCTCTCATCACTAATCTCAGCCCTTGCCCTTGAAATAAACCTTAGCTGCCCCACAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSKVS 
EALGQGT REAVGT GVRQVPGFGAADALGNRVGEAAHALGNTGHE I GRQAEDVI RHGADAVRGS WQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGSSSGSSSGGSSGGSSGGSSGNSGGSRGDSGSESSWGSSTGSSSGNHGGSGGGNGHKPGCEKPGNE 
ARGSGESGIQGFRGQGVSSNMREISKEGNRLLGGSGDNYRGQGSSWGSGGGDAVGGVNTVNSETSPGM 
FNFDT FWKNFKSKLGFI NWDAINKDQRS SRI P 

Signal peptide: 



amino acids 1-21 



||1 N-glycosylation site. 

i j » i 

^ amino acids 265-269 



Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 



Casein kinase II phosphorylation site. 

J amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myristoylation site. 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 



a 

o 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGGATGTCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
h k GAAGTCCTATATAACGATCTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 
CAGAAATGCATCTTCAGCTTTGACAGCCATTGTCAGGAGAGGCCCAGTGAATATATTGCCACCAT 
CTTGGAGCTCAGTGCCCTTGTAGAGAAAAGAAGCCAGCATATCCTCCAGCACATGGACTTTCTGT 
%l ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGT CCAAGAC TT TGGAT T TCAT T GATG T GC T T CT GC T GAGCAAGGATGAAGATGGGAAGG 
L CAT TGT CAGATGAGGATAT AAGAGCAGAGGCT GACACCT T CAT GT TT GGAGGCCATGACACCACG 

pi GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
Q ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 
AGCTGCCCTTCCTGACCATGTGCGTGAAGGAGAGCCTGAGGTTACATCCCCCAGCTCCCTTCATC 
TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTTGACCCAGAGAACAGCAAGGGGAGGTCACCTCTGGCTTTTATTCCTTTCTCCGCA 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 



•ssasr 

m 



APP 



ID=10063714 



Page 207 of 322 



FIGURE 54 

MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARILAWTYAFYNNCRRLQCFPQPPKRNWFWGHLG 
LI TPTEEGLKDSTQMSATYSQGFTVWLGPI I PFI VLCHPDT I RS I TNASAAI APKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVXYNIARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTVWPDPEVYDPFRFDPENSKGRSP 
lAFIPFSAGPRNCIGQAFAMAEMKWLAmLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLN 

VGLQ 

Important features: 
Transmembrane domains: 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature. 

amino acids 461-471 

N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE 55 

AT CGCAT CAAT TGGGAGTACCATC T T CCT CATGGGACCAGT GAAACAGCTG AAGCGAAT GT T T GA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT T CAT GGCCAGT TT TATGAAGC TTTGGAAGGCACT AT GGACAGAAGC T GGTGGACAGT T T T 
GTAACTATCTTGGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
T C TGGATGT TGTCCCACTGAAT T CCCATGAATACAAAC CTAT TCAGCAACAGCAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 56 

MGPVKQLKRMFE PTRL I AT IMVLLC FALTLCSAFWWHNKGLAL I FC ILQS LALTWYSLSFIPFAR 
DAVKKC FAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 

Type II fibronectin collagen -binding domain protein. 

amino acids 30-72 
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FIGURE 57 



CGGCTCGAGCTCGAGCCGAATCGGCTCGAGG6GCAGTGGAGCACCCAGCAGGCCGCCAAC ATGC TCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTT CAAGCT CAGT GTCTT CATCCCCT C CCAGGAAT T CT C CACCTACCGC CAGTGGAAGCAGAAAATT GTACAAGC T 
GGAGAT AAGGACCT TGAT GGGCAGCTAGACT TT GAAGAATTT GT CCATTAT CT CCAAGATCATGAGAAGAAGCT GAGG 
CTGGT GTTTAAGATTTT GGACAAAAAGAAT GAT GGAC GC ATT GACGCGCAGGAGAT CAT GCAGTCCCT GCGGGACTTG 
GGAGTCAAGATAT CTGAACAGCAGGCAGAAAAAATT CT C AAGAGCAT GGATAAAAACGGCACGATGACCAT CGACTG G 
AACGAGTGGAGAGACTACCACCT C CTCCACC CCGT GGAAAACAT CCCC GAGAT CAT CCT CT ACT GGAAGCATT CCAC G 
ATCTTTGATGTGGGTGAGAATCTAACGGTCCCGGATGAGTTCACAGTGGAGGAGAGGCAGACGGGGATGTGGTGGAGA 
(^CCTGGTGGC^LGGAGGTGGGGCAGGGGCCGTATCCAGAACCTGCACGGCCCCCCTGGACAGGCTCAAGGTGCTCATG 
CAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGG 
TCACTCT GGCGGGGCAAT GGCATCAAC GT CCT CAAAATT GCCCC CGAAT CAGC CAT CAAAT TCAT GG C CTAT GAGCAG 
AT CAAGCGCCT T GTTGGTAGT GACCAGGAGACT CTGAGGATT CACGAGAGGCTTGTGGCAGGGT CCTT GGCAGGGGC C 
ATCGCCC^GAGCAGCATCTACCCAATGGAGGTCCTGAAGACCCGGATGGCGCTGCGGAAGACAGGCCAGTACTCAGGA 
JZ ATGCTGGACTGCGCCAGGAGGATCCTGGCCAGAGAGGGGGTGGCCGCCTTCTACAAAGGCTATGTCCCCAACATGCTG 
W GGCATCAT CCCCTATGCC GGCAT C GACCTT GCAGTCTAC GAGACGCT CAAGAAT GCCT GGCT GCAGCACTAT GCAGT G 

Pi AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
C CCCT GGCCCTAGT CAGGACC C GGAT GCAGG C GCAAGCCTCTAT T GAGGGCGCTCCGGAGGTGACCAT GAGCAGCCTC 
*P* TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
h| GCT GTGAGCAT CAGCTACGT GGT CTACGAGAACCTGAAGAT CACCCT GGGC GT GCAGT C GCGGTGACGGGGGGAGGGC 

jT >! j CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
t y AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
14 GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
"p CTC CGGGCT CACAT GT GTAAGGACAGGACATT TT CT GCAGT G CCTG C CAATAGT GAGCT TGGAGCCT GGAGGCC GGCT 

*fe TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
j= CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
f-j ATAATCCATGATGAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTTCCCAACCTACAGCATTGACGCCAACTTGGC 
W TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
If! CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
ft'i GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
C CATGTT T GAGGGC GAAGGGCAGAGC GTTTGT GT GTT CT GGGGAGGGAAGGAAAAGGT GTT GGAGGC CTTAATT AT GG 
|[| ACTGTT GGGAAAAGGGTTTTGT CCAGAAG GACAAGCCGGACAAATGAGCGACTT CT GT GCTT CCAGAGGAAGACGAGG 

f?fi. GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
GGGACCAGCCCCACATT C CACTT GT GTCACT GCT T G GAACCTATTTATT TT GTATT TAT TT GAACAGAGTTATGTC CT 
fll AACTATT TTTATAGATTTGTTTAATTAATAGCT T GT CATTTT CAAGT TCAT TTTTTATT CAT ATTTAT GTT CAT G GT T 

GATT GTAC CTTC CCAAG C CCGC CCAGT GG GAT GGGAGGAGGAGGAGAAGGGGGG CCTT GGGCCGCTG CAGT CACAT CT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GTTGGGGAAGGGCTTGCCCCCAGCCTTAGGATTTCAGGGTTTGACTGGGGGCGTGGAGAGAGAGGGAGGAACCTCAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
TGAGGAAAT CTT CAATAGGAT GCAAAGAT CAATGCAAAAAT T GT TATATATGAACATATAACT GGAGTCGT CAAAAAG 
CAAATTAAGAAAGAATT GGAC GTTAGAAGTT GTCATT TAAAGCAGCCTT CTAATAAAGT TGT TT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 



MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEE FVHYLQDHEKKLRLVFKI LDKKNDGRI DAQE IMQS LRDLGVKI SEQQAEKI LKSMDKNG 
TMTIDWNEWRDYHLLHPVENIPEIILYWKHSTIFDVGENLTVPDEFTVEERQTGMWWRHLYAGGG 
AGAVSRTCTAPLDRLKVMQVHASRSNmGIVGGFTQMIREGGARSLWRGNGINVLKIAPESAIK 
FMAYEQIKRLVGSDQETLRIHERLVAGSLAGAIAQSSIYPMEVLKTRMALRKTGQYSGMLDCARR 
ILAREGVAAFYKGYVPNMLGI I PYAGI DLAVYETLKNAWLQHYAVNSADPGVFVLLACGTMSSTC 
GQLASYPLALVRTRMQAQASIEGAPEVTMSSLFKHILRTEGAFGLYRGIAPNFMKVIPA^ 
VYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 



Putative transmembrane domains: 

Q amino acids 284-304, 339-360, 376-394 



111 



P 



Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 

GGAAGGCAGCGGCAGCTCCACTCAGCCAGTACCCAGATACGCTGGGAACCTTCCCCAGCC ATGG C 
TTCCCTGGGGCAGATCCTCTTCTGGAGCATAATTAGCATCATCATTATTCTGGCTGGAGCAATTG 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACAT T GGGGAGGATGGAATCCT GAGCT GCAC TT T T GAACC T GACATCAAAC TT TCT GAT AT 
CGTGATACAATGGC TGAAGGAAGGTGTT T TAGGCT TGGT CCATGAGTT CAAAGAAGGCAAAGATG 
AGC TGT CGGAGCAGGAT GAAATGT TCAGAGGCCGGACAGCAGTGT T TGC TGATCAAGTGAT AGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CATCACTTCTAAAGGCAAGGGGAATGCTAACCTTGAGTATAAAACTGGAGCCTTCAGCATGCCGG 
AAGTGAATGTGGACTATAATGCCAGCTGAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTC T GGGCATCCCAAGTT GACCAGGGAGCCAAC T TC T CGGAAGT CT CCAAT AC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
T CAACAACACATACTCC TG TATGATT GAAAATGACAT T GCCAAAGCAACAGGGGATATCAAAGTG 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGCTGCTAAACTCAAAGGCTTCTCTGTGTGT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAA ATAA T 
GTGCCTTGGCCACAAAAAAGCATGCAAAGTCAT TGTTACAACAGGGATCTACAGAACTAT TT CAC 
CACCAGATAT GAC C TAGT T T TATAT TT CTGGGAGGAAAT GAATTCATAT CTAGAAGT CT GGAGTG 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
CTATC T T CAAAGACAT AT TAGAAGT TGGGAAAATAAT T CATGTGAACT AGACAAGTGTGT TAAGA 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACC T GGGGAGT GAGAGGACAGGATAGTGCATGT TC T T TGTCTCT GAAT TT T TAGT TATAT GT GC 
TGTAATGTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
TCT T CC CAACTGACAAAT GC CAAAGT T GAGAAAAATGATCATAAT T T TAGCATAAACAGAGCAGT 
CGGGGACAC CGAT T T TATAAATAAAC T GAGCACC T TCT T T T TAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 

MASLGQILFWSI ISIIIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 

DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 

YIITSKGKGNANLEYKTGAFSMPEVOTDYNASSETLRCEAPRWFPQPTWWASQVDQGANFSEVS 

NTSFELNSEIWTMKWSVLYOTTINNTY 

CVSSFFAISWALLPLSPYLMLK 



Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 



pi N-myristoylation sites. 

ffi amino acids 52-58, 126-132, 188-194 
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FIGURE 61 



TGACGTCAGAATCAGCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
GGTGGAC T C AGAT C AC AGT G GC TATAT CT C CAT GAAGGAGC T AAAGCAGGCC CT GGT CAAC T G CA 
AT TGGTCT TCAT TCAATGAT GAGACCT GCCT CAT GATGAT AAACATGT T T GACAAGACCAAGT CA 
CI GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
§1 CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTGTGTCCC 
Ml AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 
GGCCT T CCGGGAGAAGGACACAGC TGTACAAGGCAACAT CCGGCT CAGCTT C GAGGAC T TCGTCA 
CCATGACAGCTTCTCGGATGCT ATGA CCCAACCATCTGTGGAGAGTGGAGTGCACCAGGGACCTT 
TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
AATAGT GAGGACCGGGGCT GAGGCCACACAGATAGGGGCCT GATGGAGGAGAGGATAGAAGT T GA 
m ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 
f|| AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
ATTTGGGGCCAAAAGTCCAGTGAAATTGTAAGCTTCAATAAAAGGATGAAACTCTGA 
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FIGURE 62 

MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALWCNWSSFNDETC1MMI1M 

DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGNI RLS FEDFVTMTASRML , 



Important features of the protein: 
Signal peptide: 

amino acids 1-19 

W N-glycosylation site. 

B amino acids 147-150 

p« 

!8iS 
T" 

« Casein kinase II phosphorylation sites. 

fcl amino acids 135-138, 150-153, 202-205, 271-274 

tfi 

N-myristoylation sites. 

SE amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 

jfU 66, 70-75, 78-83, 83-88, 87-92, 110-115 
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FIGURE 63 



CAGG ATGC AGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACTTGCCTCAGCTCGGACAACCTTCCTCCACTGGCCCCTCTAACTCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
111 TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATC T GGGGTAT CAATAATCAACCCCCAGGT ACCAGCT GGGGAAATAT T AAT CGGTATCCAGGAGG 
CAGC TGGGGAAATAT TAAT CGGTATC CAGGAGGCAGC T GGGGGAATAT TAAT CGG TATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
Q CGTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
If! GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
J*j TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 

S T GCT CATC T C CAAT AAAATAAAAGC AC T T AT GAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Q- 

fffS AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 64 



MQGRVAGSCAPLGLLLVCLHLPGLFARSIGWEEKVSQNFGTNLPQLGQPSSTGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
%l Signal peptide: 

j»5 amino acids 1-26 

pi 

fe| Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

£| amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 

HI 259-264, 269-274, 270-275, 280-285, 281-286, 305-310 



mi 
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FIGURE 6s 



p 



4* 



m 



AAGGAGAGGCCACCGGGACTTCAGTGTCTCCTCCATCGCAGGAGCGCAGTGGCCACT ATG GGGTC 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACAC ATGAC AGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 



Important features : 
Signal peptide: 

amino acids 1-19 

N-glycosylation site. 

fi amino acids 37-41 



ru 



N-myristoylation sites. 

amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAMCC.ee 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATGGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCACCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTCTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 
GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 
GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 
GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 
AGAC TCTGCCCCTGAGGTCAAGAGAGGAT GGGGCTAT TCACTTTTATATAT TTATATAAAAT TAG 
TAGT GAGAT GTAAAAAAAAAAAAAAAAAA 
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FIGURE 68 



MANPGLGLLLALGLPFLLARWGRAWGQI QTTSANENS TVLPSS TS SS SDGNLRPEAI TAI IWFS 
LLAALLLAVGLALLVRKLREKRQTEGTYRPSSEEQFSHAAEARAPQDSKETVQGCLPI 

Important features: 
Signal peptide: 

amino acids 1-19 



Transmembrane domain: 

f ; i amino acids 56-80 

W 



N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6q 



GCCAGGAATAACTAGAGAGGAAC AATGG G GTT ATT CAGAGGTTTT GTTT T C CT CTTAGT T CTGTGC CTG CTGCACCAG 
T CAAATACT T CCT T CATTAAGCT GAATAAT AAT GGCTT T GAAGATATT GTCATT GTTATAGAT CCTAGT GT GC CAGAA 
GAT GAAAAAATAATT GAACAAATAGAGGAT AT GGT GACT ACAGCTT CTAC GTAC CT GTT T GAAGCCACAGAAAAAAGA 
T TTTTTTT CAAAAAT GTATCT ATATTAATT C CT GAGAATT G GAAGGAAAAT CCT CAGTACAAAAGGCCAAAACAT GAA 
AAC CATAAACATGCT GAT GTT AT AGT T GCAC CACCTACACT CC CAGGT AGAGAT GAACCATACAC CAAGCAGTTCACA 
GAATGT GGAGAGAAAGGCGAATACATTCACTTCAC C CCT GACCT T CT ACTT GGAAAAAAACAAAATGAAT AT GGACCA 
CCAGGCAAACT GTTT GTCCAT GAGT GGGCT CACCTC CGGT GGGGAGT GTTT GAT GAGTACAAT GAAGAT CAGC C TTTC 
TACCGT GCTAAGT CAAAAAAAAT C GAAGCAACAAGGT GTT CCGCAGGTAT CTC T GGTAGAAATAGAGTT TATAAGT GT 
CAAGGAGGCAGCT GT CTTAGTAGAGCAT GCAGAAT T GATT CTACAACAAAACT GT ATGGAAAAGATTGT CAAT T CTTT 
CCT GATAAAGTACAAACAGAAAAAGCAT CCATAAT GTT TAT GCAAAGTATT GATTC TGT T GTT GAATT TTGTAACGAA 
AAAAC CCATAATCAAGAAGCT CCAAGCCT ACAAAACAT AAAGT G CAATT TTAGAAGTACAT GGGAGGT GATTAGCAAT 
T CT GAGGATT TTAAAAACAC CATACCCATG GT GACAC CACCT CC T C CACCT GT CTT CTCATTGCT GAAGAT CAGT CAA 
AGAATT GT GT GCTT AGTT CT T GATAAGT CT GGAAGCAT GGGGG GTAAGGAC CGCCTAAAT CGAAT GAAT CAAGCAGCA 
AAACATTT CCT GCT G CAGACTGT T GAAAAT GGAT CCTGGGT GGGGATGGTT CACTT TGAT AGTACT GCCACTATTGTA 
AATAAGCTAAT CCAAATAAAAAGCAGTGAT GAAAGAAACACACT CATGGCAGGATTAC CTACATATCCT CT GGGAGGA 
ACTTCCATCTGCTCTGGAATTAAATATGCATTTCAGGTGATTGGAGAGCTACATTCCCAACTCGATGGATCCGAAGTA 
CTGCT GCT GACTGAT GGGGAGGATAACACTGCAAGTTC T TGTATT GAT GAAGT GAAACAAAGT GGGGCCAT TGTT CAT 
TTTATT GCTTT GGGAAGAGCT GCT GAT GAAGC AGT AAT AGAGAT GAGC AAGAT AACAGGAGGAAGT CATTT TTAT GTT 
TC^GATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTAOTCAGGAAATACTGATCTCTCCC^GAAG 
T CC CTT CAGCT CGAAAGTAAGGGATTAACACT GAAT AGT AATGCCT GGAT GAACGACACTGT CATAATT GATAGTACA 
GT GGGAAAGGACAC GTT CTTT CT CAT CACATGGAACAGT CT GC CTC CCAGTATT T CTCT CT GGGAT C CCAGTGGAACA 
ATAATGGAAAATTT CACAGT GGAT GCAACT T CCAAAAT GGC CTAT CTCAGTATT CCAGGAACT GCAAAGGT GG GCACT 
TGGGCATACAATCTTCAAGCCAAAGCGAACCCAGAAACATTAACT^ 

GTGCCT CCAATCACAGTGAAT GCTAAAAT GAATAAGGAC GTAAACAGTTTC CC CAGCCCAAT GATT GTTTAC GCAGAA 
ATT CTACAAGGATATGTACCT GTT CTTGGAGCCAAT GT GACT GCTT TCATT GAAT CACAGAAT GGACATACAGAAGTT 
TTGGAACTTTT GGATAATGGT GCAGGCGCT GATT CT TT CAAGAATGAT GGAGT CT ACT CCAG GTATTT TACAGCATAT 
ACAGAAAAT GGCAGATATAGCTTAAAAGTT C GGG CT CAT GGAGGAGCAAACACT GC CAG GCTAAAATTACGGCCTC CA 
CTGAATAGAGCCGCGTACATAC CAGG CT GGGTAGTGAAC GGGGAAATT GAAGCAAACC C GCCAAGACCT GAAATTGAT 
GAGGATACT CAGACCACCTTGGAGGATTT CAGCC GAACAGCAT C CGGAGGT GCATT TGT GGTAT CACAAGT CCCAAGC 
CTTCCCTTGCCTGACCAATACCCACCAAGTCAAATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTT 
ACAT GGACAGCACCAGGAGATAATTTTGATGTT GGAAAAGTT GAACGTTATAT CATAAGAATAAGT GCAAGTATTCTT 
GATCTAAGAGACAGTTTTGATGATGCTCTTCAAGTAAATACTACTGATCTGTCACCAAAGGAGGCCAACTCCAAGGAA 
AGCT TT GC ATTTAAACCAGAAAATAT CT CAGAAGAAAAT GCAAC CCACATATTT AT TGC CATTAAAAGTATAGATAAA 
AG CAATTT GACAT CAAAAGTAT CCAACATT GCACAAGTAACTTT GTTTAT CCCTCAAGCAAAT CCT GAT GACATTGAT 
CCTACACCT ACT CCTACT CCT ACT CCTACTCCTGATAAAAGT CAT AAT TCT GGAGT TAATATT TCTACGCT GGTATT G 
T CT GTGATT GGGT CTGTT GTAAT T GT T AACTTTATTT TAAGT ACCAC CATT TGAA CCTTAAC GAAGAAAAAAATCTT C 
AAGTAGACCTAGAAGAGAGTT T T AAAAAACAAAACAAT GTAAGTAAAGGATATT TCTGAAT CTTAAAAT TCAT C CCAT 
GT GT GATCATAAAC TCATAAAAATAATTTTAAGATGT C GGAAAAGGAT ACTT T GATTAAATAAAAACACTCAT GGATA 
T GTAAAAACTGT CAAGATTAAAATTTAATAGTT TCATT TATTT GTT ATT TTATTTGTAAGAAATAGT GATGAACAAAG 
ATCCTTTTTCATACTGATACCTGGTTGTATATTATTTGATGCAACAGTTTTCTGAAATGATATTTCAAATTGCATCAA 
GAAATTAAAAT CAT CTAT CTGAGTAGT CIAAAATACAAGT AAAGGAGAGCAAAT AAACAACATTT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 70 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKAS IMFMQSIDSWEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKH FLLQTVENGS WVGMVHFDS TAT IVNKL IQIKSS DERNTLMAGLPT YPLGGT SICSGIK 
YAFQVIGELHSQLDGSEVLLLTDGEDNTASSCIDEVKQSGAIVHFIALGRAADEAVI.EMSKITGG 
SHFYVS DEAQNNGL I DAFGALTS GNTDLSQKS LQLESKGLTLNSNAWMNDTVI I DS TVGKDT FFL 
I TWNSLPPS ISLWDPSGT IMENFTVDATSKMAYLS I PGTAKVGTWAYNLQAKANPETLT I TVTSR 
AANSSVPPITWAKMNKDTOSFPSPMIVYAEILQGWPVLGANVTAFIESQNGHTEVLELLDNGA 
GADSFKNDGWSRYFTAYTENGRYSLKVRAHGGANTARLKLRPPLNRAAYIPGWVVNGEIEANPP 
RPEIDEDTQTTLEDFSRTAS GGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRY I IRI S AS I LDLRDS FDDALQVNTTDLS PKEANSKE S FAFKPENI SEENATHI FI AI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGVNISTLVLSVIGSWI 
VNFILSTTI 

. Signal peptide: 

amino acids 1-21 

Putative transmembrane domains : 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 



FIGURE 71 



CTCCTTAGGT GGAAAC CCT GG GAGTAGAGT ACT GAC AGCAAAGACC GGGAAAGACCATAC GTCC CCGGGCAGGG GT GA 
CAACAGGT GT CAT CTTT TT GAT CT CGT GT GT GGCTGCCTTC CTATTTCAAGGAAAGACGCCAAGGTAATTT T GACCCA 
GAGGAGCAAT GAT GTAGCCAC CT C CTAACCTTC CCTTCTT GAACCC CCAGTTAT GCCAGGATTTACTAGAGAGT GT CA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GTGCTCAGAAGTGACTTGCCTGAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGTAGAGAAGCT GCT CT GTGT GGT GGTT AACT CCAAGAGGCAGAACT CGTT CTAGAAG GAAAT GGAT GCAAGCAGCT C 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GTTTTGCTGGTGCTCCTCTGCTGTGCTATCTCTGTCCTGTACATGTTGGCCTGCACCCCAAAAGGTGACGAGGAGCAG 
CT GGCACT GCCCAGGGC CAACAGC CC CAC GGGGAAGGAGGGGTACCAGGCCGT CCT TCAGGAGT GGGAGGAG CAGCAC 
C G CAACTACGTGAGC AGCCTGAAGCG GCAGAT C GCACAGCT CAAGGAGGAGCT GCAG GAGAGGAGT GAGCAG CT CAGG 
AATGGGCAGTAC CAAGC CAGCGAT GCT GCTGG C CT GGGT CTGGACAGGAGCCC CCCAGAGAAAACCCAGGCC GACCT C 
CT GGCCTT CCT GCACT C GCAG GT GGACAAGGCAGAGGT GAAT G CTGGC GTCAAGCTGGCCACAGAGTAT GCAGCAGT G 
C CTTTCGATAGCTTTACT CT AC AGAAGGT GTAC C AGCT G GAGACTGGCCTTACCCGCCACCC CGAGGAGAAGCCTGT G 
AGGAAGGACAAGC GGGAT GAGT T G GT GGAAGC CATT GAATCAGCCTT GGAGACCCT GAACAAT CCT GCAGAGAACAGC 
C CCAAT CACCGT CCTTACACGGCCTCT GATTT CAT AGAAGGGAT CT ACCGAACAGAAAGGGACAAAGGGACATT GT AT 
GAGCTCAC CTT CAAAGGGGAC C ACAAACAC GAATT CAAACGG CT CAT CTTAT TT CGACCATT CAGCC CCAT CAT GAAA 
GTGAAAAAT GAAAAGCT CAACAT GGCCAACACGCTTATCAATGTTAT CGTGCCTCTAGCAAAAAGGGTGGACAAGTT C 
C GGCAGTT CAT GCAGAATT TCAGGGAGAT GT GCATT GAGCAGGATGGGAGAGT CCATCT CACT GTTGTTTACTTTGGG 
AAAGAAGAAATAAATGAAGTCAAAGGAAT ACT T GAAAAC ACTT CCAAAGCT GC CAACTT CAGGAACTT TACCTT CAT C 
CAGCTGAATGGAGAATTTTCTCGGGGAAAGGGACTTGATGTTGGAGCCCGCTTCTGGAAGGGAAGCAACGTCCTTCTC 
TTTT TCT GTGAT GT GGACATCT ACTT CAC AT CT GAAT T C CT CAATACGT GTAGG CT GAATACACAGCCAGGGAAGAAG 
GTATTTTAT CCAGT TCTTTTCAGT CAGT ACAAT C CT G GCATAATATACGGCCAC CATGATGC AGTCC CT CCCTTGGAA 
CAGCAGCTGGTCATAAAGAAGGAAACTGGATTTTGGAGAGACTTTGGATTTGGGATGACGTGTCAGTATCGGTCAGAC 
TT CATCAATAT AGGTGG GT TT GAT CT GGACAT CAAAG GCT GGGG CGGAGAGGAT GTG CACCTTT AT CGCAAGTAT CT C 
CACAGCAACCT CATAGT GGTAC GGAC GCCTGT GCGAGGACT CTT CC ACCTCT GGCAT GAGAAGC GCT GCAT GGACGAG 
CT GACCCCCGAGCAGTACAAGAT GT GCAT GCAGT CCAAGG CCAT GAAC GAGGCAT CC CAC GGCCAGCTGGGC AT G CT G 
GT GTTCAGGCAC GAGATAGAGGCT CAC CTTCG CAAACAGAAACAGAAGACAAGTAGCAAAAAAACATGAA 
GAAGGATTGTGGGAGACA.CTTTTTCTTTCCTTTTGCAATTACTGAAAGTGGCTGCAACAGAGAAAAGACTTCCATAAA 
GGACGACAAAAGAATT GGACT GAT GGGT CAGAGAT GAGAAAGCCTCCGATTT CT CT CTGTT GGGCT T TTTACAACAGA 
AATCAAAATCTCCGCTTTGCCTGCAAAAGTAACCCAGTTGCACCCTGTGAAGTGTCTGACAAAGGCAGAATGCTTGTG 
AGAT TATAAG CCTAAT GGT GT GGAGGTTT TGATGGT GT TTACAATACACT GAGAC CTGT T GTTT T GT GTGCT CAT TGA 
AATATT CAT GATTT AAGAGCAGTTTT GTAAAAAATT CATTAG CAT GAAAGG CAAG CATAT TT CT C CT CATAT GAATGA 
GCCTAT CAGCAGGGCT CTAGT TTCTAG GAAT GCT AAAATAT CAGAAGGCAGGAGAGGAGATAGGCT TATTAT GATACT 
AGTGAGTACAT T AAGTAAAAT AAAAT GGACCAGAAAAGAAAAGAAACCATAAATAT CGTGT CATAT TTT CCC CAAGAT 
TAACCAAAAATAATCTGCTTATCTTTTTGGTTGTCCTTTTAACTGTCTCCGTTTTTTTCTTTTATTTAAAAATGCACT 
TTTTTTCCCTTGTGAGTTATAGTCTGCTTATTTAATTACCACTTTGCAAGCCTTACAAGAGAGCACAAGTTGGCCTAC 
AT TTTTAT AT TTTT TAAGAAGAT ACTTTGAGAT G CATTAT GAGAACT TT CAGTT CAAAGCAT CAAATTGAT GCC ATAT 
C CAAG GACAT GC CAAAT GC TGATT CT GTCAGGCACTGAAT GT CAGGCATT GAGACATAGGGAAGGAATGGTT TGTAC T 
AAT ACAGAC GTACAGAT ACTT T CTCT GAAGAGTAT TTTCGAAGAGGAGCAACTGAACACT GGAGGAAAAGAAAATGAC 
ACTTT CT GCTT TACAGAAAAGGAAACT CATT CAGACT GGT GATAT C GTGAT GTACCTAAAAGTCAGAAACCACATT T T 
CT CCT CAGAAGTAGGGACCGCTTT CTT AC CT GTTTAAAT AAACCAAAGTATACCGT GTGAAC CAAACAAT CT CTTTT C 
AAAACAGGGT GCTCCT C CT GGCTT CT GGCTT CCAT AAGAAGAAAT GGAGAAAAATATATATAT ATATATATATATTGT 
GAAAGAT CAAT C CATCT GCCAGAAT CTAGT GGGAT GGAAGTT TTT GCT ACATGTTAT CCACCCCAGGCCAGGTG GAAG 
TAACT GAAT TATTT TTTAAATTAAG CAGTT CTAC T CAAT CACCAAGATGCT T CT GAAAATTGCATTTTATTACCATTT 
CAAACTATT TT TTAAAAAT AAATACAGTTAACAT AGAGT GGTTT CTT CATT CAT GTGAAAAT TAT TAGCCAGCACCAG 
AT GCAT GAGCT AATTAT CT CT TT GAGT CCT T GCT T CT GTT T GCT CACAGTAAACT CATTGTT TAAAAGCTT CAAGAAC 
AT T CAAGCTGT T GGT GT GTTAAAAAAT GCATT GT ATT GAT T T GTACTGGTAGTT TAT GAAATTTAATTAAAACACAGG 
C CAT GAATGGAAGGTGGTATT GCACAGCTAAT AAAATAT GAT TT GT GGATAT GAA 



FIGURE 72 

MMMVRRGLLAWISRVWLLVLLCCAISVLYMLACTPKGDEEQLALPR^ 

EQHRNYVSSLKRQIAQLKEELQERSEQLRNGQYQASDAAGLGLDRSPPEKTQADLIAFLHSQVDK 
AEVNAGVKLATEYAAVP FDS FTLQKVYQLE T GLTRHPEEKPVRKDKRDELVEAI E SALE TLNNPA 
ENS PNHRPYTAS DFIEGI YRTERDKGTLYELT FKGDHKHE FKRL I LFRP FS P IMKVKNEKLNMAN 
TLINVIVPLAKRVDKFRQFMQNFREMCIEQDGRVHLTVWFGKEEINEVKGILENTSKAANFRNF 
T F I QLNGE FSRGKGLDVGARFWKGSNVLLFFCDVDI YFT SE FLNTCRLNT QPGKKVFYPVL FSQY 
NPGIIYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIVVRTPWGLFHLWHEKRCMDEL^ 
RKQKQKTS SKKT 



P| Important features: 

Signal peptide: 



amino acids 1-27 



p* N-glycosylation sites. 

|l| amino acids 315-319, 324-328 



N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site* 

amino acids 377-381 



APP ID-10063714 
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FIGURE 73 

GAGACTGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGCAGCAAGAGAT T T GTCC TGGGGATCCA 
GAAAC CCAT GAT ACCC TACT GAACACCGAAT CC C CTGGAAGCCCACAGAGACAGAGACAGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACATTTGACCCTGATTTGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGCTCCACCTGCACTGGGGTCAGAAAGGATCCCCAGGGGGGTCAGAACACCAGATCAACAG 
T GAAGCCACATT TGCAGAGC TCCACAT TGTACAT TAT GACT C T GAT TCCT AT GACAGCT TGAGTG 
AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AAT AT AG C T TAT G AAC AC AT T C T GAG T C AC T T G CAT G AAG T C AGGC AT AAAG AT C AGAAGAC C T C 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
AT T TCAATGGAACAGCT GGAAAAGCT T CAGGGGACAT T GT T C TC CACAGAAGAGGAGCCCTCTAA 
GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 
TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGTGTGGTCTTCACCTCAGCACAAGCCACGACTGAGGCATAAATTCCTTCTCAGATAC 
CATGGAT GT GGAT GAC T T CCC T T CATGCC TATC AGGAAGC C TC TAAAAT GGGG T GT AGGAT C T GG 
CCAGAAACACTGTAGGAG TAG TAAGCAGATGTCC TCCT T CCCC TGGACAT CT C TTAGAGAGGAAT 
GGACCCAGGCTGTCATTCCAGGAAGAACTGCAGAGCCTTCAGCCTCTCCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
CTCCTTAGGATAAAGAGTTGC TGTTGAAGTTGTATATTT TTGATCAATATATT TGGAAAT TAAAG 
TTTCTGACTTT 



FIGURE 74. 

MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNGSLTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFIQAGSSYTTGEMLSLGVGILVGCLCLLLAVYFIARKIRKKRLENRKS 

WFT S AQAT TEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 

Eukaryotic-type carbonic anhydrases proteins 

amino acids 197-245, 104-140, 22-69 



APP ID=10063714 
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FIGURE 7S 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACAT CACCT TCTTAT CCATCAACATGAAGAATGT CCTACA 
ATGGACTCCACCAGAGGGT CTTCAAGGAGT TAAAGT TACT TACACTGTGCAGTATT TCATCACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCT TCC TGTT TCCATGCAACAAATATACTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
5f AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
p| CAGAGTTCAAGGCTAAAATCATCTTCTGGTATGTTTTGCCCATATCTATTACCGTGTTTCTTTTT 
fjji TC T GT GAT GG GC TAT T C CAT C T AC CGAT AT AT C C ACG T T G G C AAAGAGAAACACC C AGCAAAT T T 

pj GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
\* ACTTTATCACCCTCAATATCTCGGATGATTCTAAAATTTCTCATCAGGATATGAGTTTACTGGGA 
f !! AAAAGCAGT GATGTATCCAGCCT TAAT GATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 

GGAAGAGGAGGTGAAACAT T T AGGGTAT GCT TCGCAT T T GAT GGAAAT T TT TTGT GACT CT GAAG 
rl AAAA(^CGGAAGGTACTTCTCTCACCCA.GCAAGAGTCCCTCAGCAGAAC^ATACCCCCGGATAAA 
|l| ACAGTCAT TGAATATGAATATGATGTCAGAACCACTGACATT TGTGCGGGGCCTGAAGAGCAGGA 

GCTCAGTTTGCAGGAGGAGGTGTCCACACAAGGAACAT TAT TGGAGTCGCAGGCAGCGTT GGCAG 
TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 
CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCC T T CTGAGGGGGAT GGGC TCGGAGAGGAGGGT C T T CTATCTAGACT CT AT GAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACC TATCT CAT GCAATT CATGGAGGAAT GGGGGTT ATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 



2 C « 
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FIGURE 76 



MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESFVPGPPRRAQPSEKQCARTLK 

DQS S E FKAKI I FWYVLP I S I T VFLFS VMG YS I YRY I HVGKEKHPAN L I L I YGNE FDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
ALAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGCCACCCT CAGT C CCAC TGCAG T T CT CAT CC TCGGC CCAAAAGTCAT CAAAGAAAAGC T GAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGCATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
CCAATCTAGTGAAAAACCAGCTGTGTCCCGTGATCGAGGCTTCCTTCAATGGCATGTATGCAGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGATCAATGAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGATCCTAACTCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TT TACAGCAAAGGTGACCAAC TTATACT CAAC T T GAAT AACAT CAGC TC T GATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGGTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCA GTGAA GACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 
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FIGURE 78 



MAGPWTFTLLCGLLAATLIQATLSPTAVLILGPKVIKEKLTQELKDHNATSILQQLPLLSAMREK 
PAGGI PVLGSLVNTVLKHI IWLKVITANI LQLQVKPSANDQELLVKI PLDMVAGFNTPLVKT IVE 
FHMTTEAQATIMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLWAl^QVMNLLVPSLPN^ 
VKNQLCPVIEAS FNGMYADLLQLVKVP ISLSI DRLEFDLLYPAI KGDT IQLYLGAKLLDSQGKVT 
KWFNNSAASLTMPTLDNIPFSLIVSQDWKAAVAAVLSPEEFMVLLDSVLPESAHRLKSSIGLIN 
EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQLILNLNNISSDRIQLMNSGIGWFQPDVLKNI ITEI IHSILLPNQNGKLRSGVPVSLVKALG 
FEAAE S S LTKDALVLT PAS LWKP S S PVS Q 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site, 
amino acids 412-415 

LBP / BPI / CETP family proteins. 

amino acids 407-457 
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FIGURE 7Q 

GAGAGAAG T CAGC C T GG C AGAG AGAC T C T G AAAT GAG G GAT T AGAG G T G T T C AAGGAG C AAG AGC 

T T CAGCCTGAAGACAAGGGAGCAGTCC C T GAAGACGCT TCT AC TGAGAGGT CTGCCATGGCCT CT 

CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 

GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 

CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 

ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 

CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 

GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGGGATTC 

ATTGCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 

CATGAAATTTGAGATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAG 

CTGGAATCATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 

CAAGCCCAACCTCTTGCCACAAGGAGCTCTCGAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 

GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 

GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 

GTCAGAAGGTGCTGCT GAGGATAGACTGACTT TGGCCAT T GGATTGAGCAAAGGCAGAAATGGGG 

GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 

TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 

GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 

TCCCACTGACTGACCCTCTGTGATCAAAGACCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCATT 

GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 

CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 

GAC TAAT T T GT GCATGAACT GAAAT AAAACCAT CC TAC GGT AT CCAGGGAAC AGAAAGC AGGAT G 

CAGGAT G GGAGGACAGGAAG G C AGC C T G G G AC AT T T AAAAAAAT A 



FIGURE 80 



MASLGLQLVGYILGLLGLLGTLVSMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 

I YS T LLGLPAD I QAAQAMMVTS S AI S S LAC 1 1 S WGMRCT VFCQE SRAKDRVAVAGGVFFI LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 
DAYQAQPLATRS SPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 4 6-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
pi GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
g! TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
W AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 

ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC % 
'» AC TGCCAGACTCC T GCT GCCAAGCC TGCAAAGAT GAGGCAAGTGAGCAAT CGGATGAAGAGGACA 

-/ GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
Q AGAGGCCCGGG^CCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
W CAGACCCAAGGGAGCAGGCAGCACAAC T G T CAAGATCGT CC TGAAGGAGAAACATAAGAAAGCC T 

|;f GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
SI CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 
flj CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
C T CGGACT TGGTGGAGATCTACCTC TGGAAG C TGGTAAAAGAT GAGGAAACT GAGGC TCAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
GAAGCAAGACT TCCAGAAAGAGGCACAG CACT T CCGACT GC T CGC TGGCCCCCACGAAGGT CACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACATAACAAAGACC TAACAGT TGCAGATATGAGC TGT AT AAT T GT T G T TAT TATATATT AAT AAA 
T AAGAAG T T G CAT T AC C C T C AAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



MVPEVRVLSSLLGLALLWFPLDSHARARPDMFCLFHGKRYSPGESWHPYLEPQGLMYCLRCTCSE 
GAHVSCYRLHCPPVHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEVWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S S TRCPKAPGRVLVHTSVSPSPDNLRRFALEHEASDLVE I YLWKLVKDEETEAQRGEVPGPRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 

amino acids 1-25 
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FIGURE 83 

GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGT TTCCATGCC GT GAGGTC CATT CAC AGAAC AC ATCC ATGGCT CT C AT GCT CAGTTT GGT T CT GAGT C 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 
CC AGTTCT CT AGCGTGGT CC AC CTCT AC AGGGACGGGAAGGACC AGCC ATTT AT GC AGAT GCCACAGT ATC 
AAGG CAGG AC AAAACT GGTGAAGGAT TCT ATT GC GG AG GGGCGC ATCTCT CT GAGGCT GGAAAAC ATT ACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
ACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTAC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
ACAGACT C CAGGAC AAAC AG AG AC ATGC ATGGCC TGTT TG ATGTGGAGAT CT CTCT GACCGTCC AAGAGAA 
CGCCGGGAGC AT AT CC TGTT CC ATGC GGCATGCT C ATCTGAGCCGAG AGGTGGAAT CC AGGGT AC AGAT AG 
GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 
T T T T TT GGCAT TGT TGGACT GAAGAT TT T CTT CT CC AAAT TCC AGT GG AAAATCCAGGCGGAACT GGACT G 
GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
AGACGGCT CACC CG AAGCT CTGCGTT TC T G ATCT GAAAAC TGT AACCC AT AGAAAAGCT CCCCAGGAGGT G 
CCT C AC TC TGAGAAG AGATT T ACAAGGAAGAGTGTGGT GGCT TCTC AGAGTT T CCAAGCAGGGAAAC ATT A 
CTGGGAGGTGGACGGAGGACACAATAAAAGGTGGCGCGTGGGAGTGTGCCGGGATGATGTGGACAGGAGGA 
AGGAGT ACGTGACT T T GT CT CCCG AT C ATGGGT ACTGGGTC CTC AGACTGAATGGAGAAC ATTT GT AT T T C 
AC AT TAAAT CCC CGT T TT AT CAGCGT CT T C CCCAGGACCC CACCT ACAAAAAT AGGGGT CTT CCT GGACT A 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCT T ATT GAGGCC CT AC ATT GAGT AT CCGTC CT AT AATGAGCAAAAT GGAACT CCCAT AGT CATC TGC 
CC AGT C ACCC AGGAAT CAGAGAAAGAGGCC TCTT GGC AAAGGGC CTCT GC AATCCCAGAGAC AAGC AAC AG 
TGAGT C CT CCTC AC AGGC AACC AC GCCC T T CCTCCCCAGGGGT GAAAT GTAGG ATGAATC AC AT CCCAC AT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACT GGCCTGTC C ACATGGGAGTC AGGT GT CATGGCTGCCCT GAGC TGGGAGGG AAGAAGG 
CT GAC ATT AC AT TT AGTTTGCT CT CACT CC AT C T GGCT AAGT GATCTT GAAAT ACC AC CT CT CAGGTGAAG 
AAC CGTCAGGAATT CCC ATCT C AC AGG CT GTGGT GT AG AT T AAGT AGACAAGGAAT GT GAAT AATGCT TAG 
AT CT TATT GATGAC AGAGTGT AT CCT AATGGT T T GT TC AT T AT ATT ACACTT TC AGT AAAAAAA 



FIGURE 8d 



MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMEVRFFRGQFSSWH 
LYRDGKDQPFMQMPQYQGRTKLVKDSIAEGRISLRLENITVLDAGLYGCRISSQSYYQKAIWELQ 
VSALGSVPLISITGYVDRDIQLLCQSSGWFPRPTAKWKGPQGQDLSTDSRTNRDMHGLFDVEISL 
TVQENAGSISCSMRHAHLSREVESRVQIGDTFFEPISWHLATKVLGILCCGLFFGIVGLKIFFSK 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQSFQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKEYVTLSPDHGYWVLRLNGEHLYFT 
LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSY3SIEQNGT 




Signal peptide: 



PIVICPVTQESEKEASWQRASAIPETSNSESSSQATTPFLPRGEM 



amino acids 1-17 



Transmembrane domain: 



amino acids 239-255 
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FIGURE 8s 

AACAGACGTTCCCTCGCGGCCCTGGCACCTCTAACCCCAGACATGCTGCTGCTGCTGCTGCCCCT 
GC TCTGGGGGAGGGAGAGGGC GGAAGGACAGACAAGTAAACT GC TGACGATGCAGAGTTCCGTGA 
CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 
TACCC T GGCC CAGTAG T T CATGGC TAC T GGT TCCGGGAAGGGGCCAATACAGACCAGGAT GC TCC 
AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 
GGGACCCACATACCAAGAAT TGCACCC TGAGCATCAGAGAT GCCAGAAGAAGTGATGCGGGGAGA 
TAC TT C T TT CGTATGGAGAAAGGAAGT AT AAAATGGAAT TATAAACAT CACCGGC TC TCT GT GAA 
TGTGACAGCCTTGACCCACAGGCCC7VACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 
AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 
GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 
GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 
ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 
GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 
GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 
GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 
CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 
GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 
CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 
TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 
AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 
CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 
AAGCCT T GGGACT CGC GGGGACAGGAGGCCAC TGACACCGAGTACT CGGAGATCAAGAT CCACAG 
ATGAGAAACT GCAGAGACTCACCCTGAT TGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 
GAGGCT GAT T C T TGTAGAAT TAACAGC CCT CAACGT GATGAGCTAT GATAACAC TATGAAT TATG 
TGCAGAGTGAAAAGCACACAGGCTTTAGAGTCAAAGTATCTCAAACCTGAATCCACACTGTGCCC 
TCCCTT T TATT T T T T T AACT AAAAGACAGACAAAT TCC TA 
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5-3 

amino acids 1-15 



FIGURE 86 

MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPVVHGYWFREG 
ANTDQDAPVATNNPARAVWEETRDRFHLLGDPHTKNCTLS IRDARRSDAGRYFFRMEKGS IKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLS FCVI FVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 
YSEIKIHR 

Signal peptide: 



Transmembrane domain: 



% amino acids 351-370 



in 
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FIGURE 87 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAGTGCAGCTGAGACTCAGACAAGAT TACAATGAACCAACTCAGCT TCCTGCTGT TTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGACCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
CAACTACAACACCTTTGGATCTGCAGAGGCGGCCACGAGCGATGACTACAAGAACCCTGGCTACT 
ACGACATCCAGGCCAAGGACCTGGGCATCTGGCACGTGCCCAATAAGTCCCCCATGCAGCACTGG 
AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
GTTTGGCAT CTACCAGAAATATCCAGT GAAATATGGAGAAGGAAAGTGT T GGAC TGACAACGGCC 
CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
GGCCAGCGGGAATTCACTGCGGGATTTGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAA 
CGCCT TGTGT GC TGGAAT GAGGGT CAC CGGATGTAACAC TGAGCAT CACT GCATTGGTGGAGGAG 
GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
GGAACTCATGTTGGTTACAGCAGCAGCCGTGAGATAACTGAGGCAGCTGTGCTTCTATTCTATCG 
TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
CAACT TACC CAGTAGCTAGAATGT TAAT GGCAGAAGAGAAAACAATAAAT CATAT TGAC TCAAGA 
AAAAAA 



FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 
YQTFCDMTSGGGGWTLVASVHENDMRGKCTVGDRWSSQQGSKADYPEGDGNWANYNTFGSAEAAT 
SDDYKNPGYYD I QAKDLGI WHVPNKS PMQHWRNS S LLRYRT DTG FLQTLGHNL FG I YQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 

amino acids 1-16 

N-glycosylation site, 

amino acids 163-167 

Glycosaminoglycan attachment sites. 

amino acids 74-78, 289-293 

N-myristoylation sites. 

amino acids 76-82/ 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCC ATGG GCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACTGAAATGGCTTTATTCGTCACCGTCTTTGGGCTGAAAAAGAAACCCTTCTGATTACCTTCA 
f*-h TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
ffl GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
W TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T TAAGAC T TATATACAGT T T TAGGGGACAATTAAAAAAAAAAAA 
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FIGURE qo 

MGRVSGLVPSRFLTLLAHLVWITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFLSGVSMFNSTQSLISIGAHCSASVALSFFIFERWECTTYWYIFVFCSALPAVTEMALFV 
TVFGLKKKPF 

Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 
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FIGURE Ql 

CTGGGACCCCGAAAAGAGAAGGGGAGAGCGAGGGGACGAGAGCGGAGGAGGAAGATCCAACTGAC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCT CAGCCAAGGT GAAGAAAATC T T T GGC T GGGGCGAC T TCTACT CCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
GAGTGGGAGAAGGTAGAACGGGGCCGCCGGACCTCGCTTTGCACCCACGACCCAGCCAAGATCTG 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATCGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGTCCCAAGTGCTGGTCCCAACCTGAAGCTGTGGAGTGACTAGATCACAGGAGCACTGG 
AGGAGGAGT GGGC TCT CTGT GCAGCC T CACAGGGC T TTGCCACGGAGCCACAGAGAGAT GCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCTTGGTTCTTGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGATTTCATCAGTGTGGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CATGAGGCTGAAGTGGCAACCCTGGGGTCTTTGATGTCTTGACAGATTGACCATCTGTCTCCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACAC C CAT CC T TAAG C T AAGAC AGGAC GAT TGTGGTCCTCC C AC AC T AAG GC C AC AGC C CAT C 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAA.CCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 



FIGURE Q2 

MQLTRCCFVFLVQGSLYLVICGQDDGPPGSEDPERDDHEGQPRPRVPRKRGHISPKSRPMANSTL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNISISLVPPSKAVEFHQEQQIFIEAKASKIFNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 



CGGTGGCC ATGA CTGCGGCCGTGTTCTTCGGCTGCGCCTTCATTGCCTTCGGGCCTGCGCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAGATGGACCAACACAGAAATATCTGCTGATCTTTGGAGCGTTTGTCTCTGTCTATATCCAAGAA 
AT G T TCCGAT T T GCATAT TATAAACTCT TAAAAAAAGCCAGT GAAGGT T TGAAGAGT ATAAACCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
" fF . GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
p\ ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
|f! TCCTGACCCACCTGCTGGTGTCAGCCCAGACCTTCATAAGTTCTTATTATGGAATAAACCTGGCG 
W TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
AAGCCTGAAACTC TGCCTGCTCTGCCAAGACAAGAACT TTCT TCT T TACAACCAGCGCTCCAGAT 
AACCTCAGGGAACCAGCACTTCCCAAACCGCAGACTACATCTTTAGAGGAAGCACAACTGTGCCT 
T TT TC T GAAAATCCC TT TT TC TGGTGGAAT TGAGAAAGAAATAAAAC TAT GCAGATA 
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FIGURE Q4 

MTAAVFFGCAFI AFGPALALYVFT I AIEPLRI I FL I AGAFFWLVSLL I S SLVWFMARVI I DNKDG 
PTQKYLLI FGAFVSVYIQEMFRFAYYKLLKKASEGLKS INPGETAPSMRLLAYVSGLGFGIMSGV 
FSFWTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQT FI S S YYG INLASAFI I LVLMGTWAFIjAAGGS CRSLKLCLLCQDKNFLL YNQRS R 

Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE Q5 

AA.TTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGGTAGCAAT CT GAAACATGAGGAGTACGAT TCT AC TGT T T T GT CT TC TAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACACTACCAAACCAACAGCAGTCAAATCAGGTCTT TCCT TCT TTAAGTCTGATACCATTAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGCAGGAATGACACCTGGTAC 
CCAGACCCACCCATTGACCCTGGGAGGGTTGAATGTACAACAGCAACTGCACCCACATGTGTTAC 
CAATTTTTGTCACACAACTTGGAGCCCAGGGCACTATCCTAAGCTCAGAGGAATTGCCACAAATC 
TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
GCAGGCATCCAAAGGAGCACACATGCCATCGAGGAAGCCACCACAGAATCAGCAAATGGAATTCA 
GTAAGCTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
AT T GAT TAT AT TAT GGAAT AGAT T GAGACACAT T GGAT AGT C T TAGAAGAAAT TAAT TCT TAAT T 
T ACCTGAAAATAT TCTT GAAAT T TCAGAAAATATGTTCT ATG T AGAGAAT CCCAAC T TT T AAAAA 
CAATAATTCAATGGATAAAT CTGTC T T T GAAATATAACATT AT GCT GCCT GGATGATAT GCATAT 
TAAAACATAT T TGGAAAAC T GGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE Q6 

MRSTILLFCLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPAAGMTPGTQTHPLTLGGLNVQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQ I FTS L 1 1 HS L FPGG I LPTSQAGANPDVQDGS L PAGGAGVNPATQGT PAGRL PT PSG 
T DDDFAVT T PAG I QRS THAI EEATTE SANG I Q 



ill 



Signal peptide: 

amino acids 1-16 
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FIGURE Q7 

GCTCAAGTGCCCTGCCTTGCCCCACCGAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 

CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATC 

GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 

AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 

CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 

GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 

GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 

AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 

AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 

GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 

CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 

AC CAGCC T T GAC CACGCCC T GGAGAGGACC TACCAGCT GTT GGTACAGGT CAAGGACATGGG TGA 

CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAG.TCTCGATCATAGAGAGCACCTGGGTGT 

CCC TAGAGCC TAT CC ACCT GGCAGAGAATCT CAAAGT CCT AT ACCCGCACCACAT GGCCCAGGTA 

CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 

T GCAGAGGGAAACC TCTAC GT GACCAGAGAGC TGGACAGAGAAGCCCAGGCT GAGTACC TGC T CC 

AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 

ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 

CAGTCCACCAGGTAC T GAAG TGAC TAGACT GT CAGCAGAGGAT GCAGAT GCCCCCGGCT CCC CCA 

ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 

CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 

CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 

TCGAAGTCGCAGTCACAGATATCAATGATCACGCCCCTGAGTTCATCACTTCCCAGATTGGGCCT 

ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 

CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 

GCC TGGAT TGGGAGCCAGAC T C TGGGCAT GT TAGAC TCAGACTCTGCAAGAACC TCAGT TAT GAG 

GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 

AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 

ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 

CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 

CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAG.GGCGCCCAGCCTGGGG 

ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 

TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 

GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 

GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 

GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 

CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 

CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 

AT TT TCACCCACTGGACCATGTCAAGGAAGAAGGACCCGGATCAACCAGCAGACAGCGT GCCCCT 

GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 

TCC CCT GGGAGAGAGCCCAGCACCCAAGATC CAGCAGGGGACAGGACAGAGTAGAAGCCCC T C CA 

TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 

TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 

CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 



FIGURE Q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFAMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRGTRPGIPFLFLEASDRDEPGTANSDLRFHILSQAPAQPSPDMFQLEPRLGALALSPKG 
S T S LDHALERT YQLLVQVKDMGDQAS GHQATATVEVS I IESTWVSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPI CPPRDPTVS I PELS PPGTEVTRLSAEDADAPGS PNSHWYQLLSPEPEDGVEGRA 
FQVDPTSGSVTLGVLPLRAGQNILLLYLAMDLAGAEGGFSSTCEVEVAVTDINDHAPEFITSQIG 
PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
EAAPSHEWVWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPISAPAGSFLLT 
IQPSDPISRTLRFSLWDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
REH 1 1 PVWS HNAQMWQLLTOVI VCRCNVE GQCMRKVGRMKGMP TKLSAVG I LVGTLVAI G I FL I 
L I FTHWTMSRKKDPDQPADSVPLKATV 

Signal peptide: 

amino acids 1-18 

Transmembrane domain: 

amino acids 762-784 
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FIGURE QQ 

GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AG TC CAAGAT TC T TCCCAGGAACACAAACGT AGGAGACC CACGCT CC T GGAAGCAC CAGC CT T TA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATC.TAGGACC C AGG C AT CTTGCTTTC C AG C C ACAAAG AG AC AGATGAAGAT G C AGAAAG G AAAT G 
TTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTGCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAGTCACCAAC TCT GAGTTCCATACAACCTCCAGTGGGATCAGCACAGCCACCAACTCT GAG 
T T CAGCACAGCGTCCAGTGGGAT CAGCATAGC CACCAAC TCTGAG T CCAGCACAACC T CCAGTGG 
GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 

C T GGGT CCAGTGTGAC C TCCAGTGGAGCCAGCACTGCCACCAACT C TGAGT CCAGCACAGTGT CC 
AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
CAACT C T GACTCCAGCACAACCT CCAGT GGGGCTAGCACAGCCACCAACTCT GAGTCCAGCACAA 
CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
GCCACCAAC T CT GAGT CCAGCACAACCT CCAGTGGGGCCAGCACAGCCAC CAAC TCT GAG T CCAG 
AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGC TAGCACAGCCACCAACT C TGAC TCCAGCACAACCT CCAG T GGGGCCGGCACAGCCAC CAACT 
C TGAGT C CAGCACAGT GT CCAGT GGGATCAGCACAGT C AC CAATT CTGAGT CCAGCACACCCT CC 
AGT GGGGCCAACACAGCCAC CAAC TCT GAGT CCAG TAC GACCTCCAGT GGGGCCAACACAGCCAC 
CAAC TCT GAGTCCAGCACAGT GT CCAGT GGGGCCAGCAC TGCCAC CAACT CT GAGTCCAGCACAA 
CC TCCAGT GGGGTCAGCACAGCCAC CAACT C T GAGTCCAGCACAAC CT CCAGT GGGGCTAGCACA 
GC CACCAAC T C TGACT CCAGCACAACCT CCAGTGAGGCCAGCACAGCCAC CAAC TC T GAGT C TAG 
CACAGT GTCCAGT GGGAT CAGCACAGT CACCAAT TCT GAGT CCAGCACAACC TCCAGTGGGGC CA 
ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 
GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
GTGGAGTCCTAACTGGTTCTGGAGGAGACCAGTATCATCGATAGCCATGGAGATGAGCGGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AATC TT GAAGAAGGTAT T CC TCACC TT T C TT GCCT T TAC CAGACAC TGGAAAGAGAATACTATAT 
TGCTCAT TTAGCTAAGAAATAAATACATCTCATCTAACACACACGACAAAGAGAAGCTGT GCT TG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT C TGGCAT TCAAAAT CTCCACAGTAAAAT CCAAAGACC TCAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE lOO 



MKMQKGNVLLMFGLLLHLEAATNSNETSTSANTGSSVISSGASTATNSGSSVTSSGVSTATISGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSS'GASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTTSSGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGLFAGLFFCVRNSLSLRNTFNTAVYHPHGLNHGLGPGPGGNHGAPHRPRWSPNWFWRRPVSSI 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 



APPJD=10063714 



Page 254 



FIGURE 101 



GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 

CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 

GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 

GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 

AACGAGGCAGTTTCTAGCTACTGCACGTACTTCATAAAGCAGGACTCTAAAAGCTTTGGAATCAT 

GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 

TCATGCT GAG TCCC TT T T TACC T T TGATGTT T GTAAACCCAT CT TGGTATCGC TGGATCAACAAC 

CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 

GATTATAACTGGGGATGCATT TGTTCCT GGAGAAAGAAGTGTCAT TATCATGAACCATCGGACAA 

GAATGGAGTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 

TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 

C TT CAT T CAT AGGAAATGGAAGGAT GACAAGAGCCAT T T CGAAGACAT GAT T GAT TACT T T T G TG 

ATAT TCACGAACCAC T T CAAC TCC TCAT AT T CCCAGAAGGGAC TGATCT CACAGAAAACAGCAAG 

TCTCGAAGTAATGCATTTGCTGAAAAAAATGGACTTCAGAAATATGAATATGTTTTACATCCAAG 

AACTACAGGCTTTACTTTTGTGGTAGACCGTCTAAGAGAAGGTAAGAACCTTGATGCTGTCCATG 

ATATCACTGTGGCGTATCCT CACAACATT CCTCAATCAGAGAAGCACCTCCTCCAAGGAGACT TT 

CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 

CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 

GGGAGAAGAATTTTTATTTTACCGGACAGAGTGTCATTCCACCTTGCAAGTCTGAACTCAGGGTC 

CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 

ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 

GAATAT T T GG TGGAC T GGAGAT CATAGAAC T TGCAT GT T ACCGAC T T T TACACAAACAGCCACAT 

TTAAAT TCAAAGAAAAATGAGTAAGATT ATAAGGTT TGCCAT GTGAAAACCTAGAGCATATT TTG 

GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 

GCCATCATTATTTGTTAAAGATATTTTGCACTTAATTTTGTGGGAAAAATATTGCTACAATTTTT 

T TTAATCTCTGAATGTAATT T C GAT AC T GTGTACATAGCAGGGAGTGAT CGGGGTGAAATAACT T 

GGGCCAGAATAT TAT T AAAC AAT CAT C AGG C T T T T AAA 



FIGURE 102 



MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKS FGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPIJYIFVNPSWYRWINNRLVATWLTLPVALL^^ 

FLWNCLMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHNIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
Y FTGQSVI PPCKSELRVLWKLLS I L YWTLFS PAMCLL I YLYS L VKWY FI I T I VI FVLQERI FGG 
LEI IELACYRLLHKQPHLNSKKNE 

jassij 
El 

m Important features of the protein: 

'8:3$ 

||! Signal peptide: 

PI amino acids 1-22 

Transmembrane domains: 

amino acids 44-63, 90-108, 354-377 



Co 



APP_ID=1 00637 14 



Page 256 of 322 



FIGURE 1Q3 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
CACCT CT ACAGAGATGGGGAAGAC TGGGAAT CTAAGCAGAT GCCACAGTAT CGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACATCGGCCTGTATGGGTGCTGGTTCAGTTCCCAGATTTACGATGAGGAGGCCACCTGGGAGCTG 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
AGGATT T GT CT TCAGAC TCCAGAGCAAATGCAGATGGGTACAGCC TGTATGATGTGGAGATC T CC 
ATTATAGTCCAGGAAAATGCTGGGAGCATATTGTGTTCCATCCACCTTGCTGAGCAGAGTCATGA 
GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
AAATC CAAAGGGAAAAT C CAGGCGGAACTGGAC TGGAGAAGAAAGCAC GGACAGGCAGAAT T GAG 
AGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAGAGACGGCTCACCCGAAGCTCTGCG 
T T TC T GATCT GAAAACTGTAACCCATAGAAAAGC T CCCCAGGAGGT GCCTCAC TCTGAGAAGAGA 
T T TACAAGGAAGAGTGT GGT GGC T T CT CAGGGT T T CCAAGCAGGGAGACAT TACTGGGAGG T GGA 
CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
ATGTGACT T T GTC TCCCAACAAT GGGTATT GGGT CC TCAGAC TGACAACAGAACAT T TGTATT T C 
ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
TGACAT GTCAGT TTGAAGGCTTGT TGAGACCCTATATCCAGCATGCGATGTATGACGAGGAAAAG 
GGGACT CCCATAT T CATATGTCCAG TG T CCT GGGGATGAGACAGAGAAGACCC TGC T TAAAGGGC 
CCCACAC CACAGAC CCAGACACAGCCAAGGGAGAG TGC TCCCGACAGGT GGCCCCAGCT T CCT C T 
CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 
TCAGAAGCCAT GGC TGCCCT GAAGT GGGGACGGAATAGACT CACAT TAGGTT TAGT TTGT GAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAGTGAT T CC TGCC T CACAGGTGAAGAT T AAAGAGACAACGAAT GTGAATCAT GC T TGCAGGT T 
TGAGGGCACAGTGT T TGC T AAT GAT GT GT T T T TATAT TATACAT T T T CCCACCATAAAC T C T GT T 
TGC T TAT TCCACAT TAAT TT AC TT T TCT CTATACCAAATCACCCAT GGAATAG T TAT TGAACACC 
TGCTTTGT GAGGC TCAAAGAAT AAAGAGGAGGTAGGAT T T TT CACTGAT T CTATAAGCCCAGCAT 
TACC TGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGT CCAT AT CC 
C TCATTAACACAGACACAAAAATTC TAAATAAAATTTTAACAAATTAAACTAAACAATATATTTA 
AAGAT GATATATAAC T AC TCAGTGT GGT T TGT CCCACAAATGCAGAGT T GGTT TAATAT T TAAAT 
ATCAACCAGT GTAAT T CAGCACAT TAATAAAGT AAAAAAGAAAACCATAAAAAAAAAAAAAAA 



FIGURE lOd 

MAFVL I LVL S FYELVS GQWQVTG PGK FVQALVGEDAVFS CS L FPE T SAEAMEVRFFRNQ FHAWH 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAALGSLPLIS I VGYVDGGI QLLCLS SGWFPQPTAKWKGPQGQDLS SDSRANADGYSLYDVE I S I 
I VQENAGS I LCS I HLAEQSHE VE SKVL I GE T FFQPS PWRLAS I LLGLLCGALCGWMGMI I VFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFI SLPPS TPPTRVGVFLDYEGGT I S FFNTNDQSL I YT LLTCQFEGLLRPYI QHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE IPS 

CCTT C AC AGG ACTCTT CATTGCTGGT TG GC A ATGA TGT AT CGGCCAGATGTGGT G AGG GCT AGG AAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CTCACTGTTCATTATGTGAGATATAATCAAAAGAAGACCTACAATTACTATAGCACATTGTCATTTACAAC 
TG AC AAACT AT ATGCT GAGT TT GGCAGAGAGGCTT C TAAC AAT T T TAC AG AAAT GAGC CAGAGACT TGAAT 
C AAT GGTG AAAAAT GC ATTTT AT AAATCT CC ATT AAGGGAAG AAT T TGTC AAGT C TC AGGT TAT CAAGTTC 
AGTC AACAGAAGC ATGGAGT GT T GGCTCAT AT GCTGTT GATT TGT AGATTTCACTCT ACT GAGGAT CCTG A 
AACT GT AG AT AAAATT GTTCAACTT GTT TT AC AT GAAAAGCT GCAAGATGC T GT AGG ACCCCCT AAAGT AG 
ATCCTCACTCAGTTAAAATTAAAAAAATCAACAAGACAGAAACAGACAGCTATCTAAACCATTGCTGCGGA 
AC ACGAAGAAGT AAAACTCT AG GT CAGAGTCT CAGG ATCGTT GGTGGGAC AG AAGT AG AAGAGGGT GAAT G 
GCCCTGGCAGGCTAGCCTGCAGTGGGATGGGAGTCATCGCTGTGGAGCAACCTTAATTAATGCCACATGGC 
TT GT GAGT GC TGCT CACT GTTT T ACAAC AT AT AAGAAC CCTGCC AG ATGGACT GCTTCCT TT GGAGT AAC A 
AT AAAACCTT CG AAAATG AAACGGGGTCTCCGGAGAAT AATT GT CC AT GAAAAAT AC AAAC ACCCAT CAC A 
TG ACT ATG AT ATTT CT CT TGCAGAGC TTTCT AGC CCTGTT C CCT AC AC AAAT GC AGT AC AT AGAGT TTGTC 
TCCCTGAT GC AT CC T ATG AG TT TC AACC AGGTGATGTGAT GTTTGTGAC AGG AT TT GG AGCACT GAAAAAT 
GAT GGT T AC AGTCAAAAT C AT CTTCGAC AAGC AC AGGT GACTC T CAT AGACGCT AC AACT T GCAAT GAACC 
TC AAGCTT AC AATGACGCC AT AACTCCT AGAATGT TAT GT GCTGGCT CCT T AG AAGGAAAAAC AGATGC AT 
GCC AGGGTGACT CTGGAGGACC AC TGGT T AGTTC AGATGCT AGAGAT ATCTGGT AC CT TGCT GGAAT AGT G 
AGCT GGGGAGAT GAAT GT GCGAAACCCAAC AAGCCTGGTG TTTAT AC T AGAGT T ACGGCCTTGCGGGACT G 
GAT T AC TT CAAAAACT GGTATC TAAGAG AC AAAAGCCT C ATGGAAC AG AT AACATTTT T TTT TGTT TT TTG 
GGTGTGGAGGCC AT T T TT AGAGAT AC AGAATT GGAGAAGACT TGCAAAAC AGCT AG AT T TGACT GATCT CA 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AACT C TGT CATC TGT GAGCAAT AGTT GAAACT TT AT GT AC AT AG AGAAAT AGAT AAT ACAAT AT TAC 
AT T AC AGC CT GT AT TC ATT T GT T C TCT AGAAGTT T T GT CAG AAT TTT GAC TT GTTG AC AT AAAT T T GT AAT 
GC AT AT AT AC AATT T GAAGC AC TCCT TTT C TT CAGTT CCT CAGCTCCTCT CAT TT C AGCAAATAT C CAT T T 
TC AAGGTGCAGAACAAGGAGTGAAAGAAAAT AT AAG AAGAAAAAAATCCC CT AC AT TT TATT GGCAC AGAA 
AAGT AT TAGGTGTT T TTC TT AGT GGAAT AT T AGAAATGAT CAT ATT CATT AT GAAAGG TC AAGC AAAG AC A 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
AT AT AT CCT TAT TT T C AT TT CC AAAC AACT ACT ATGAT AAAT GTGAAG AAGAT TCT GT T TTT TT GTGACCT 
AT AAT AAT TAT ACAAACT TC ATGC AAT GTACT TGTT CT AAGC AAATTAAAGC AAAT AT T T AT TT AACATT G 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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FIGURE 106 

MMYRPDWRARKRVC WE PWVI GLVI FI SLIVLAVCIGLTVHYVRYNQKKTYNYYS TLS FTTDKLY 
AEFGREASNNFTEMSQRLESMVKNAFYKSPLRESFVKSQYIKFSQQKHGVLAHMLLICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TE VEEGE WPWQAS LQWDGSHRCGATL I NATWLVSAAHC FTT YKNPARWTAS FGVT I KPSKMKRGL 
RRIIVHEKYKHPSHDYDISLAELSSPVPYTNAVHRVCLPDASYEFQPGDVMFVTGFGALKNDGYS 
QNHLRQAQVT L I DAT T CNE PQAYNDAI T PRMLCAG SLE GKT DACQGDS GGPLVS S DARD I WYLAG 
IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 



^ Transmembrane domain: 

III amino acids 21-40 (type II) 

till 
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FIGURE 107 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGT.GAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TCTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAATGT TC T CACT TTAACT GCAGAAACGAG GGTAGAGGAAGCAG T CAT T TT GAC T TACT T TCC T 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
GATATGGTCACTTTGAAAGCCAGGATGACAAATTATGGATTACCTAGATATCGGTGGCTTACTCA 
N« TGCTTGGAATTTTTTTCAGAGAGAGTTTAAGTGCTGTGGAGTAGTATATTTCACTGACTGGTTGG 
CS AAATGACAGAGATGGACT GGCCCCCAGAT T CCTGC TGTGTTAGAGAATT CCCAGGAT GT T CCAAA 

Q CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGT TGT GGGAAGAAAAT GTAT TCCT T 

3| TT TGAGAGGAACCAAACAAC TGCAGGT GCTGAGGT T T CT GGGAAT C TCCAT T GGGG T GACACAAA 

f ! TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
Pi AC AGAC C AAAT GAT G T C C T T G AAG AAT G AC AAC T C T CAG C AC C T G T CAT G T C C C T C AGT AG AAC T 
St GTT GAAACCAAGCC TGT CAAGAAT CT T T GAACACACAT C CAT GGCAAACAGC T T TAATACACACT 

Uh T TGAGAT GGAGGAGT T ATAAAAAGAAAT GT CACAGAAGAAAAC CACAAACT T GT TT TAT T GGAC T 

ji- TGT GAAT T T T TGAGTACATAC TAT GTGT T T CAGAAATAT GTAGAAATAAAAAT GT TGCCATAAAA 

T AACAC CT AAGCATAT ACTAT T CTATGC T T TAAAATGAGGATGGAAAAG T T TCATGTC AT AAGT C 
* m ACCACC TGGACAATAAT T GATGCCCTTAAAATGCTGAAGACAGAT GTCATACC CACT GT G TAGCC 

CI TGTGTATGACT T T TACT GAACACAGTTATGT T T TGAGGCAGCATGGT T T GAT TAGCAT T T CCGCA 

H| TCCATGCAAACGAGT CACATATGGTGGGACT GGAGCCATAGT AAAGGT T GAT T TAC T TCT ACCAA 

PI C T AGT AT AT AAAGTACT AAT TAAATGCTAACATAGGAAGT TAGAAAAT AC TAATAAC TT T TAT TA 

C T CAGCGAT CT AT T CTT CTGATGC TAAATAAAT TAT AT AT CAGAAAACT T TCAATAT TGGT GAC T 
ACC TAAATGT GAT T TT T GCT GG TTAC T AAAATAT TC TTACCAC T TAAAAGAGCAAGC TAACACAT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
T TCAGT T CT GATAATGTTAAGAATAACCAT TATGAAAAGGAAAAT T TGTCC TGTAT AGCAT CAT T 
ATT TTTAGCCTTTCCTGTTAATAAAGCT TTAC TATTCTGTCCTGGGCTTATATTACACATATAAC 
TGTTAT TT AAAT ACT T AAC CACTAAT TT TGAAAAT TACCAGTGTGATACATAGGAATCAT TATTC 
AGAATGTAGT CTGGTCTT TAGGAAGTAT TAATAAGAAAATTTGCACATAACTTAGTTGATTCAGA 
AAGGACTTGTATGCTGTTTTTCTCCCA^lATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GCT TAT CT T T GCC T TCT C C AAACAAGAAG CAAT AG T C TC C AAG T C AAT AT AAAT T C T AC AG AAAA 
TAG TGT TCTTTTTCT CCAGAAAAATGCT T GT GAGAATCAT TAAAACAT GT GACAAT T TAGAGAT T 
CTTTGTTTTATTT CAC T GAT TAATATAC TGT GGCAAAT T ACAC AGAT TAT TAAAT t T T T T T ACAA 
GAG TAT AGT ATAT T TAT T T GAAATGGGAAAAGT GCAT T T TACT GTAT T TT G T GTAT T T T GTT TAT 
TTC TCAGAATATGGAAAGAAAAT TAAAAT GT GT CAATAAATAT T TT C T AGAGAG TAA 



Ms 
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FIGURE 108 



MAREDSVKCLRCLLYALNLLFWLMS I SVLAVSAWMRDYLNNVLTLTAETRVEEAVI LT Y FPWHP 
VMIAVCCFL 1 1 VGMLGYCGTVKRNLLLLAWYFGSLLVI FCVELACGVWT YEQELMVPVQWSDMVT 
LKAI^TNYGLPRYRWLTHAWNFFQREF^CCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCPSVELLKPSLSRI FEHTSMANS FNTHFEMEEL 

Signal peptide: 

amino acids 1-33 

Transmembrane domains: 

amino acids 12-35, 57-86, 94-114, 226-248 
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FIGURE lOQ 

CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GATAAAGTCTTTCTCGTGAGAGCCTAGAGGCCTTAAAAAAAAAAGTGCTTGAAAGAGAAGGGGACAAAGGAACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CC T GGAG GAAC AC T G ACC ACCAGT T GG AT GAGT CT C AAGGTC C T C CT CT AT GT GAC AAC CAT GT GAAT GGGGAG 
TGGTACCACTTCACGGGCATGGCGGGAGATGCCATGCCTACCTTCTGCATACCAGAAAACCACTGTGGAACCCA 
CGCACCTGTCTGGCT CAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CTGACCAAGCCCAGCGTCTGCTTCCACGTCTACTGTGGTCATTTTTATGACATCTGCGACGAGGACTGCCATGG 
C AGC T GC T C AG AT AC C AGC GAGT GCACATGC GCTCCAGGAACT GTGCTAGGCCCT GACAGGCAGACATGCTTTG 
AT GAAAAT GAAT GTGAGCAAAAC AACGGT GGCT GC AGT GAGAT CT GT GT GAACCT C AAAAACTC CT ACC GCT GT 
GAGT GTGGGGT T GGCCGT GTGCT AAGAAGTGAT GGC AAGACT T GT GAAGAC GT T GAAGGAT GCC AC AATAAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
CTGAGGAT AACCACACTT GCCAAGTCCCT GT GTT GTGCAAATC AAAT GCCATT GAAGT GAACAT CCCC AGGGAG 
CTGGTTGGTGGCCTGGAGCTCTTCCTGACCAACACCTCCTGCCGAGGAGTGTCCAACGGCACCCATGTCAACAT 
CCTCTTCTCTCT C AAGAC AT GT G GT AC AGT GGTC GAT GT GGT GAAT GAC AAGAT T GT GGC C AGC AAC C T C GT GA 
C AG GT CT ACC C AAGC AGAC C CC G GGGAGC AGC GGGGAC T T CAT CAT C C GAACC AGC AAGC T GCT GAT CCC GGTG 
ACCTGCGAGTTTCCACGCCTGTACACCATTTCTGAAGGATACGTTCCCAACCTTCGAAACTCCCCACTGGAAAT 
CAT GAGC C GAAAT CAT GGGATCT T CCC ATT C ACT CT GGAGAT C TT C AAGGAC AAT GAGT T T GAAGAGC CT T AC C 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TT G GAAAGCT T G GT GGAGAGC T GCTT T GCCACCCC CACC T CCAAGATC GACGAG GT C C T GAAAT ACT AC CTC AT 
CCGGGAT GGCT GT GT TTCAGAT GACTC GGT AAAGC AGT ACACATCCC GGGATC ACC T AGC AAAGCACTTCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGTCACAGCACTGCTGAACAATGTGGCCTGGGTGGGGTTTCATCTTTCTAGGGTTGAAAACTAAACTGTCCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CAC AAAATCAGAAGCTGGGT AT AAT AT TTCAAGTT ACAAACCCTAGAAAAAT TAAACAGTT ACT GAAATT AT GA 
C T T AAAT ACCC AAT G ACT C CTT AAAT AT GT AAAT TAT AGT T AT AC CT T GAAAT T TC AAT T CAAAT GC AGACT AA 
TTAT AGGGAAT TT GGAAGT GTATCAAT AAAACAGT AT AT AATT TT 



FIGURE no 



MP P FLLLT CL F I TGT S VS P VAL D PC SAY I S LN E P WRNT DH QL DE S QG P PLCDNH VNGE W Y H FT GMAGDAMP 
TFCIPENHCGTHAPWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAI EVNIPRELVGG 
LELFLTNTSCRGVSNGTHWILFSLKTCGTWDVWDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPVVHV 
SGLE SLVE SCFATPT SKI DEVLKY YLIRDGCVSDDSVKQYTSRDHLAKH FQVPVFKFVGKDHKE VFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPI RI OWED 



f?* Important features of the protein: 

|S! Signal peptide: 

amino acids 1-16 

2 : 

N-glycosylation sites. 

amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

■ 

TZ Tyrosine kinase phosphorylation sites. 

Hi 

. p| amino acids 411-418, 443-451 

w 

|3 N-myristoylation sites. 

f|| amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 

522-528, 531-537 

Aspartic acid and asparagine hydroxylation site. 

amino acids 197-209 

ZP domain proteins . 

amino acids 431-457 

Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 



GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGC CAGT GCT GACC AGG GACT TC T G AC CT GC T GGC C AGCC AGG AC CTGT GT GGGG AG GCCCT 
CCTGCTGC CT T GGGGT GAC AAT CTCAGC TC C AGGCT AC AGGG AGACCGGGAGGATC AC AGAGCC AG CATGT 
TACAGGATCCTGACAGTGATCAACCTCTGAACAGCCTCGATGTCAAACCCCTGCGCAAACCCCGTATCCCC 
AT GG AGACCT T C AG AAAGGT GGGGAT C CCC AT C ATCAT AGCACT AC TGAGCC TG GCGAGT ATC ATC AT TGT 
GGTTGTCCTCATCAAGGTGATTCTGGATAAATACTACTTCCTCTGCGGGCAGCCTCTCCACTTCATCCCGA 
GG AAGC AGCTGTGT G ACGGAGAGCTGG ACT GT CCCT TGGGGGAGGACG AGGAGC ACT GTGTCAAGAGCTTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACTGGTTCTCTGCCTGTTTCGACAACTTCACAGAAGCTCTCGCTGAGACAGCCTGTAGGCAGATGG 
GCTACAGCAGAGCTGTGGAGATTGGCCCAGACCAGGATCTGGATGTTGTTGAAATCACAGAAAACAGCCAG 
•S GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
\ „ \ TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
%\ TCAGCAT C CAGT ACGACAAACAGC ACGT CT GT GGAGGGAGCATCCTGGACCC CC ACT G GGTCCTC ACGGCA 

*f»h GCCCACTGCTTCAGGAAACATACCGATGTGTTCAACTGGAAGGTGCGGGCAGGCTCAGACAAACTGGGCAG 
sfK CTTCCCATCCCTGGCTGTGGCCAAGATCATCATCATTGAATTCAACCCCATGTACCCCAAAGACAATGACA 
* TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGC T C ACTCCAGC CACCCC ACTC TGGATC AT TGGATGGGGCTT T ACGAAGC AG AAT GGAGGGAA 



tn 



GATGTCTG AC AT ACTGCTGC AG GCGT CAGTCC AGGT CATTGAC AGC AC ACGGT GCAAT GC AGACGATGCGT 
AC CAGGGGGAAGT C ACCGAG AAGATG ATGTGT GC AGGCAT CCCGGAAGGGGGT GTGGAC ACCTGCC AGGGT 
GAC AGTGGTGGGCCCCTGAT GT ACCAAT CT GACC AGTGGCAT GT GGTGGGCAT C GT T AGC TGGGGC TAT GG 
CTGCGGGGGCCCGAGCACCCCAGGAGTATACACCAAGGTCTCAGCCTATCTCAACTGGATCTACAATGTCT 
GGAAGGCTGAGCTGTAATGCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGGATCCCCCAAAGTCAGACACAGAGCAAGAGTCCCCTTGGGTACACCCCTCTGCCCACAGCCTCAGCAT 
TT CT TGGAGC AGCAAAGGGCCT CAAT TC CTGT AAGAGACC CTCGC AGCCC AGAGGC GCCC AGAGGAAGTC A 
GC AGCCCT AGCTCG GC CACACT TGGT GC TCCC AGC AT C CC AGGGAGAG ACACAGCCCACT GAAC AAGGTCT 
C AGGGGTATT GCTAAG CCAAGAAGGAACTTTCCC AC AC T ACTGAATGGAAGC AGGCTGTC T T GT AAAAG CC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCTACTAGAGCAAGAAACCAGTTGTAATATAAAATGCACTGCCCTACTGTTGGTATGACTACCGTTACCT 
AC TGT T GT CATT GT T ATT AC AGCT AT GGC C ACT AT TAT TAAAGAGCT GTGTAAC AT CTCT GGC AAAAAAAA 
AAAA 
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FIGURE 112 



MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIVWLIKVILDKYYFLCG 
QPLHFI PRKQLCDGELDCPLGEDEEHCVKS FPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAETACRQMGYSRAVEIGPDQDLDWEITENSQELRMRNSSGPCLSGSLVSLHCLACGKSL 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWLTAAHCFRKHTDVF1MKVIIAGSDKL 
GSFPSLAVAKIIIIEFNPMYPKDNDIALMKLQFPLTFSGTVRPICLPFFDEELTPATPLWIIGWG 
FTKQNGGKMSDILLQASVQVIDSTRCNADDAYQGEVTEKMMCAGIPEGGVDTCQGDSGGPLMYQS 
DQWHWGIVSWGYGCGGPSTPGVYTKVSAYLNWIYNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE lift 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
T AGGTGTAAGCCACCGTGTCTGGCCT CT GAACAAC TT T T T CAGCAAC T AAAAAAGC CACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
N< CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
y AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
III ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
\| CAGAAGGGGT CTGC TT AGACCACC TGGT TT ATGTGACAGGAC TTGCAT T C TCC TGGAACAT GAGG 

H GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACT T GT GCGAAATTAT GGGTCAGAAAAGATG 

GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
% h CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
Ifl CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
Q TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAATGGCTCAGTGTTGGCCCAGGAGGTCAGCAAGGCCTGAGAGCTGATCAGAAGGGCCTGCT 
GT GCGAACAC GGAAAT GCCT CCAGTAAGCACAGGC T GCAAAAT CCCCAGGCAAAGGACTGTGTGG 
CTCAATTTAAATCATGTTCTAGTAATTGGAGCTGTCCCCAAGACGAAAGGAGCTAGAGCTTGGTT 
CAAATGAT CT CCAAGGGCCCT TATACCCCAGGAGAC T T T GATT TGAATT T GAAACCC CAAATCCA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
C CAGCTAC TCGGGAGGGT GAGACAGGAGAATTACT T GAACCT GGGAGGTGAAGGAGGCTGAGACA 
GGAGAAT CAC T T C AGCC T GAGC AACACAG C GAGAC T C T G T CT C AGAAAAAAT AAAAAAAG AAT T A 
TGGTTATTTGTAA 



APP_ID=10063714 



Page 267 of 322 



FIGURE lid 

MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEALTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE U5 

CAGCAGTGGTCTCTCAGTCCTCTCAAAGCAAGGAAAGAGTACTGTGTGCTGAGAGACCATGGCAA 
AGAATCCTCCAGAGAAT TGTGAAGACT GTCACATTCTAAATGCAGAAGCT TT TAAATCCAAGAAA 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TCTACAGCAATGGAGAGAAGAAGAAGAT T TACAT GGAAAT T GATCC TG T GACCAGAACT GAAAT A 
TTCAGAAGCGGAAATGGCAC T GATGAAACAT TGGAAG T GCACGAC T T TAAAAACGGATACACTGG 
CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
CT GAACCAGAAGAGGAAATAGATGAGAAT GAAGAAAT TACCACAACTT TC T T TGAACAGT CAGTG 
AT T TGGGT CCCAGCAGAAAAGCC T ATT GAAAACCGAGAT T T T C T TAAAAAT T CCAAAAT T CT GGA 
GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAATATCAGTTTCTGAGTTACAAG 
ACT TTGAGGAGGAGGGAGAAGATC T TCAC TT TCC TGCCAACGAAAAAAAAGGGAT TGAACAAAAT 
GAACAGT GGGTGG TCCC TCAAGTGAAAGTAGAGAAGACC CGT CAC GCCAGACAAGCAAGT GAGGA 
AGAACT T CCAATAAAT GACT ATAC TGAAAAT GGAATAGAAT T T GATCCCATGCTGGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T TAT CAC C AAAAAAAAAAAAAAAAAAA 



FIGURE 116 



MAKNPPENCEDCHILNAEAFKSKKICKSLKICGLVFGILALTLIVLFWGSKHFWPEVPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCC I YCRRGNRYCRRVCE PLLGY YPYPYC YQGGRVI CRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 

Transmembrane domain: 

amino acids 25-47 (type II) 

N-glycosylation sites* 

amino acids 94-91, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 
242 

N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 

Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCT GC AT CGCGGCCACC GGGATGGACATGTGGAGC ACCC AGGACCTGT ACG ACAACCC CGT CACCT 
CCGT GT TCC AGT ACG AAGGGCT CTGGAGGAGC TG CGTG AGGC AGAGT TCAGGCT TCACCG AAT GCAGGCC C 
TATT TC ACC ATCCT GGGACT TCCAGCCATGCT GC AGGC AGT GCGAGCCCT GATGAT CGTAGGC ATCGTCC T 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CC AAAGCCAAC ATG ACACTGACCTCC GGGATC AT GTTCAT T GTCTCAGGT CT TT GT GC AATTGC T G GAGT G 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 
GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 
TC AC AC T AAT TGGGGGTGTG AT GATGTGC ATCGC CTGC CG GGGCCT GGCACC AGAAGAAACCAACT AC AAA 
GCCGTT TC T TAT C ATGC CTC AGGCCAC AG T GT TGCCT ACAAGCCTGGAGGCT TC AAGGCC AGCACT GGCT T 
TGGGTCCAACACCAAAAACAAGAAGATATACGATGGAGGTGCCCGCACAGAGGACGAGGTACAATCTTATC 
CTTCCAAGCACGACTATGTGTAATGCTCTAAGACCTCTCAGCACGGGCGGAAGAAACTCCCGGAGAGCTCA 
CCCAAAAAACAAGGAGATCCCATCTAGATTTCTTCTTGCTTTTGACTCACAGCTGGAAGTTAGAAAAGCCT 
CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GC T GAGTT ATTT AT GAAT TAGAGGCT AT AGCT C ACATT TTC AAT CCT CT ATTTCTT TT T TTAAAT ATAACT 
TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 
TT CCT C CT AGTC AATAAACCC AT T GAT G ATCT AT T T CCCAGCTT ATCCCC AAG AAAACT TTT GAAAGG AAA 
GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 



: f4 AC ACTT AC TGAAGAAGAAGC AAT AAG AG AAAGAT AT TT GT AAT C TCTCC AGC CC AT GAT CTCGGTTTT CTT 



ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
CGCGGGTC AGAAAT TGT C CCT AGATG AAT G AGAAAAT T AT TT TT TT T AAT TT AAGT CCT AAAT AT AGT T AA 
AAT AAATAATGT TT T AGT AAAAT GAT AC AC TAT CTC TGT GAAAT AGCCTC ACCCCT ACATGT GG AT AGAAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
AT GAAAAGCTC ACACCTGTAATCC TAGC AC TTTGGG AGGC TGAGGAGGAAGGATCACT TGAGCC C AGAAG T 
TC GAGACT AGCC TGGGC AACAT GG AGAAGC CCT GTCTCT ACAAAAT AC AGAGAGAAAAAATC AGCC AGTC A 
TGGTGGCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGGAGGT 
TGGGGCTGCAGTGAGCCATGATCACACCACTGCACTCCAGCCAGGTGACATAGCGAGATCCTGTCTAAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 



APP ID=10063714 



Page 271 of 322 



FIGURE 118 

MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDMPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRALMIVGIVLGAIGLLVSIFALKCIRIGSMEDSAKANMTLTSGIMFIVSGL 
CAI AGVS VFANMLVTNFM^ S TANMYT GMGGMVQ T VQTRY T FGAAL FVGWVAGGLT L I GGVMMC I A 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQSYPSKHDY 
V 

Signal peptide: 

amino acids 1-23 



Transmembrane domains: 

amino acids 81-100, 121-141, 173-194 



m 



6! 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAAGAACATCGTGGTTTTTGAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
CAT CACGGGCATGGTGGTGCTCATCCCTGTGAGCTGGGTTGCCAATGCCATCATCAGAGATTTC TATA 
ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACTCGATACCTTCCCATCGCACAACCCAAAAAAGTTATCACACCGGAAAGAAGTCACCGAGCG 



h.-i TCTACTCCAGAAGTCAGTATGTGTAGTTGTGTATGTTTTTTTAACTTTACTATAAAGCCATGCAAATG 



ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
4* AATCTTAATTACAGGAACTGT GCATCAGCTAT TTATGAT T CTATAAGCT ATT TCAGCAGAATGAGATA 

* TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
W CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 

It! 

AC GACATAGCAT TAT GT AC AT AGAT GAGT GTAACAT T TATAT CT C ACAT AGAGACAT GCT TAT AT GGT 
|JiJ T TTAT TTAAAATGAAATGCCAGTCCATTACACTGAATAAATAGAACT CAACT ATTGCTTT T CAGGGAA. 

f% ATCAT GGATAGGGTT GAAGAAGGTT ACTATTAATTGT TTAAAAACAGCT TAGGGATT AAT GTCC TCCA 

|1| T TTATAAT GAAGATT AAAATGAAGGCTT TAAT CAGCATTGTAAAGGAAATT GAATGGCTTT CTGATAT 

GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTATACTCAGAAGAAAGATAAAAGTGTGATCTAAGAAAAAGTGATGGTTTTAGGAAAGTG 
AAAATAT TT TTGT TTTTGT ATTTGAAGAAGAATGAT GCAT TTTGACAAGAAA.TCATATATGTAT GGAT 
ATATTTTAATAAGTATTTGAGTACAGACTTTGAGGTTTCATCAATATAAATAAAAGAGCAGAAAAATA 
TGTCTTGGTTTTCATTTGCTTACCAAAAAAACAACAACAAAAAAAGTTGTCCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACAT T TATATTAATAAA.TTGTACATT TT T CTAATT 



APP ID=10063714 



Page 273 of 322 



m 

0 



FIGURE 120 



MATHALEIAGLFLGGVGMVGTVAVTVMPQWRVSAFIENNIWFENFWEGLWMNCVRQANIRMQCK 
I YDS LLALS PDLQAARGLMCAAS VMS FIAFMMAILGMKCTRCTGDNEKVKAHILLTAGIIFIITG 
MWL I PVS WVANAI I RD FYNS I VNVAQKRE LGEAL YLGWT TALVL I VGGAL FCC VFCCNEKSSSY 
RYSIPSHRTTQKSYHTGKKSPSVYSRSQYV 

Signal peptide: 

amino acids 1-17 



Transmembrane domains: 



Hi 

Q 

m amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



t; 

m 

Vi 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTGC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACACCTGGGATCCCAGGTCGGGATG 
GATTCAAAGGAGAAAAGGGGGAATGTCTGAGGGAAAGCTTTGAGGAGTCCTGGACACCCAACTAC 
AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
ATTGAAGCTATAATT T ATT TGGACCAAGGAAGCCC TGAAAT GAAT T CAACAAT TAATAT T CAT CG 
CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 
TTGGCACTTGTTCAGATTACCCAAAAGGAGATGCTTCTACTGGATGGAATTCAGTTTCTCGCATC 
ATTATTGAAGAACTACCAAA ATAA ATGCTTTAATTTTCATTTGCTACCTCTTTTTTTATTATGCC 
TTGGAAT GGTTCACT TAAATGACATTT TAAATAAGTTTATGTATACATCTGAATGAAAAGCAAAG 
CTAAATATGTTTACAGACCAAAGTGTGATTTCACACTGTTTTTAAATCTAGCATTATTCATTTTG 
CTTCAATCAAAAGTGGTTTCAATATTTTTTTTAGTTGGTTAGAATACTTTCTTCATAGTCACATT 
CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 
AAAAAAT AT AAAAGC TACCAAT CT T TGT ACAAT T T GT AAAT GT TAAGAAT T T T T T T TAT AT CT GT 
TAAATAAAAAT TAT T TC CAACA 
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FIGURE 122 

MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFTFNGAECSGPLPIEAIIYLDQGSPEMNSTINIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI I IEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T GC CT GGAAGAAT ACAT C ATGT T T T T C GATAAGAAGAAATT GT AGGAT CCAGT TTTTTTTT T A 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATCC 
TATTACCTAGGAAGATTTTGATGTTTTGCTGCGAATGCGGTGTTGGGATTTATTTGTTCTTGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCCCC T AAGGCAAAGCAAAAGACC TAAGGAC GACC TT TGAACAATACAAAGGATGGGT T T CAAT G 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
ATAACAGCCT TCAAAAAC T TAAGTATAAT CAATT TAAAGGGCTCAACCAGCT CACC T GGCT ATAC 
C T T GACCATAACCATAT CAGCAATAT T GACGAAAATGC T T T TAAT GGAAT ACGCAGACT CAAAGA 
GCTGAT TC T T AGT TCCAAT AGAATC TCC TAT T TTCT TAACAATACCT T CAGACCT GTGACAAAT T 
TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 
CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACT GCCGCAACCTGGAAC T TT T GGACCT GGGATATAACCGGAT CCGAAGT T TAGCCAGGAATG 
TC T T TGCT GGCATGATCAGAC TCAAAGAAC T T CACC TG GAG CACAAT CAAT T T TCCAAGCT CAAC 
CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 
CATAGGACAGACCATGT CCTGGACCTGGAGCTCCT TACAAAGGCTTGAT TTATCAGGCAATGAGA 
TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 
TCCAACAAGCTCACAT TTATTGGTCAAGAGATT TTGGAT TCTTGGATATCCCTCAATGACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
T TAAAGGTC TAAGGGAGAAT ACAAT TATCTG TGCCAG TC CCAAAGAGC T GCAAGGAGTAAAT GT G 
ATCGATGCAGTGAAGAACTACAGCATCTGTGGCAAAAGTACTACAGAGAGGTTTGATCTGGCCAG 
GGCTCTCCCAAAGCCGACGTTTAAGCCCAAGCTCCCCAGGCCGAAGCATGAGAGCAAACCCCCTT 
TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 
CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAGTCCCTAAAGCAAATGACTCCCAGCACCCAGGAATTTTATGTA 
GAT TATAAAC CCACCAACACGGAGACCAGCGAGAT GCT GCT GAATGGGAC GGGACCC TGCACCTA 
TAACAAATCGGGC T CCAGGGAGT GTGAGGT ATGAACCAT T GT GATAAAAAGAGC TC T TAAAAGC T 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
GGTCAT T T TCC T C TCATACATAAT CAACCCAT TGAAAT TTAAATAC CACAAT CAATGTGAAGCT T 
GAAC TCCGGT T TAATATAATACC TATTGTATAAGACCC TT TAC TGAT T CCAT TAATGTCGCAT T T 
GT T T TAAGAT AAAAC T T C T T T CATAGGT AAAAAAAAAAA 



FIGURE 124 



if" i 



MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
Ijy^FAGMIRLKELHLEHNQFSKLNIALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
SGNEIEAFSGPSVFQCVPNLQRLNLDSNKLTFIGQEILDSWISLNDISLAGNIWECSRNICSLVN 
WLKS FKGLRENT 1 1 CASPKELQGVNVI DAVKNYS ICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQEFYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 



||| Important features of the protein: 

|y Signal peptide: 

amino acids 1-33 



Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites* 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, . 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 125 

CCGTTATCGTCTTGCGCTACTGCTGAATGTCCGTCCCGGAGGAGGAGGAGAGGCTTTTGCCGCTG 
ACCCAGAGATGGCCCCGAGGGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACCTTTCCCCTGGATCTCACAAAAACTCGACTCCAAATGCAAGGAGAAGCAGCTCTTGCTC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
AT TGAAGAGGAAGGC T T T CT AAAGCT T T GGCAAGGAGT GACACCCGCCAT T TACAGACACGT AGT 
GTAT T C TGGAGGTCGAATGGTCACATAT GAACAT C TCC GAGAGGT T GT GT T T GGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
T T AGC C AAT C C AAC T GAC C TAG T G AAG G T T C AGAT GC AAAT G GAAG GAAAAAG GAAAC T G G AAG G 
AAAACCAT T GCGAT TTCGT GG TGTACAT CAT GCAT TT GCAAAAAT CT TAGCT GAAGGAGGAATAC 
GAGGGC T T T G GGCAGGC T G GG TAC CCAAT AT AC AAAGAG CAG C AC T GG T GAAT AT GG GAGAT T TA 
ACCACT T AT GATACAGT GAAAC AC T AC T TGGT AT TGAAT ACACCAC T TGAGGACAATAT CAT GAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 
TCAAAAGCAGAATAAT GAAT CAACCACGAGATAAACAAGGAAGGGGACT TT TG TATAAAT CATCG 
ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 
ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
TGAGTGGAGTCAGTCCATTTTAA 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
AP YRGMVRT ALG 1 1 EEEGFLKLWQGVT PAI YRHVVYS GGRMVT YEHLREWFGKS EDEHYPLWKS 
VIGGMMAGVI GQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAKI LAEGG IRGLWAGWP 
NIQRAALVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 

CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAGAGGACATC CT CTCCATGT TCT CAGC TAT CCGT T C TCAGCACAGCGGT GTAGACATC 
TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
AGGAGC G GAAT G T GGAC GAT G GG C ACAT CAT T AACAT C AAT AGC AT GT C T G GC C AC CGAG TG T T A 
CCCCTGTCTGTGACCCACTTCTATAGTGCCACCAAGTATGCCGTCACTGCGCTGACAGAGGGACT 
GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
AGACACAAT T CGCC T T CAAAC TCCACGACAAGGACCC TGAGAAGGCAGC TGCCACCTAT GAGCAA 
ATGAAGTGTCTCAAACCCGAGGATGTGGCCGAGGCTGTTATCTACGTCCTCAGCACCCCCGCACA 
CATCCAGATT GGAGACATCCAGATGAGGCCCACGGAGCAGGT GACC TAGTGAC TGTGGGAGC T CC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 
CACGGGAT ACCAC T TCCTGT CCACACCCCGACCAGGGGC TAGAAAATT T GT T TGAGATTT TT AT A 
TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 
TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 
TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 
TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCCTGGCCCAG TGGAT T TCATGGTGAT CAT TAAAAAAGAAAAAT CGCAACCAA 
AAAAAAAAAA 



FIGURE 128 



MARPGMERWRDRLAL VTGAS GG I GAAVARALVQQGLKWGCART VGN I EE LAAECKSAGYPGTLI 
PYRCDLSNEEDI L SMFSAIRSQHSGVD I C INNAGLARPDT LLS GS T S GWKDMFNVNVLALS I CTR 
EAYQSMKERNVDDGHIININSMSGHRVLPLSVTHFYSATKYAVTALTEGLRQELREAQTHIRATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 



lb:::;: 

a 



AACTTCTACATGGGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
TTAATTTTTAACTGATAGTTGTACATATTTGGGGGTACATGTGATATTTGGATACATGTATACAA 
TATATAATGATCAAATCAGGGTAACTGGGATATCCATCACATCAAACATTTATTTTTTATTCTTT 
TTAGACAGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGTGCCATCTCAGCTTACTGCAAC 
CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 
GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 
CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 
GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 
ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 
ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 
T T CCAT CCAT GT T GCTGCAAATGACAGGAT T TCGT TCT TAAT T TCAAT T AAAATAACCACACATG 
GCAAAAA 
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FIGURE ISO 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGE S PMP FKRVFCQDGNVRS FCVCAVHFS S HQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

pa 

amino acids 86-92 

Q 

Zinc carboxypeptidases , zinc-binding region 2 signature. 

amino acids 68-79 



III 



o 



U 
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FIGURE 131 



TT CTGAAGTAACGG AAGCT ACC TT GT AT AAAG AC CT CAAC AC TGCTGACC ATGATC AGCGCAGCCT GGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAATACCAGAGGATGCTACAACTCTCTACCTTCAGAACAACCAAATAAATAATGCTGGGATTCCTTCAGAT 
TT GAAAAACT T GCT GAAAGT AG AAAG AAT AT ACCT ATACC AC AACAGT TT AGAT GAAT TT CCT ACC AACCT 
CCCAAAGT AT GT AAAAGAGT T AC AT T TG CAAG AAAATAAC AT AAGGAC T ATC ACTT AT GATTCACT TTCAA 
AAATTCCCT AT CTGGAAGAATT AC AT TT AGAT GAC AACTC TGT C TCTGC AGT TAGC AT AGAAGAGGGAGC A 
TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GCCC AG GACT AT AGAAGAACTACGCT TGGATG AT AAT C GC AT AT CCAC TAT T TCAT CACC AT CT CTT C AAG 
GTCTCACTAGTCTAAAACGCCTGGTTCTAGATGGAAACCTGTTGAACAATCATGGTTTAGGTGACAAAGTT 
TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 
TCCAGGCACAAACC TGAGGAAGCT TT AT CTT C AAGAT AAC CACAT C AATCGGGT GCCCCC AAATGCTTT T T 
CT T ATCT AAGGC AGCT CT AT CG ACTGGAT AT GTCCAAT AATAACCT AAGT AAT T TACCTC AGGGTATCT T T 
GAT GAT T T GG AC AATATAAC AC AACT GATTC T TC GC AAC AATCCCT GGT ATT GC GGGTGC AAGAT GAAAT G 
GGT ACGT G AC TGGT T ACAAT C ACT ACCT GTGAAGGT C AACGTGCGT GGGCTC AT GT GCCAAGCC CC AG AAA 
AGGT TC GT GGGATGGC TATT AAGG AT CTC AAT GC AGAACTGTTT GAT TGT AAGGAC AGTGGGAT T GTAAGC 
ACCATTCAGATAACCACTGCAATACCCAACACAGTGTATCCTGCCCAAGGACAGTGGCCAGCTCCAGTGAC 
CAAACAGCCAGATATT AAGAACCCCAAGCTCACTAAGGAT CAACAAACCACAGGGAGTCCCT CAAGAAAAA 
m CAATTACAATTACTGTGAAGTCTGTCACCTCTGATACCATTCATATCTCTTGGAAACTTGCTCTACCTATG 
g| AC TGCT TT GAGACT C AGCTGGCT T AAACTGGGCC AT AGCCCGGCATT T GGAT CT AT AACAGAAAC AATT GT 

f|| AACAGGGGAACGCAGTGAGTACTTGGTCACAGCCCTGGAGCCTGATTCACCCTATAAAGTATGCATGGTTC 
CCATGGAAACCAGCAACCTCTACCTATTTGATGAAACTCCTGTTTGTATTGAGACTGAAACTGCACCCCTT 
CGAATGTACAACCCTACAACCACCCTCAATCGAGAGCAAGAGAAAGAACCTTACAAAAACCCCAATTTACC 
TTTGGCTGCCATCATTGGTGGGGCTGTGGCCCTGGTTACCATTGCCCTTCTTGCTTTAGTGTGTTGGTATG 
TTCATAGGAATGGATCGCTCTTCTCAAGGAACTGTGCATATAGCAAAGGGAGGAGAAGAAAGGATGACTAT 
GC AGAAGCTGGC ACT AAGAAGGAC AACTCTAT CCT GGAAATC AGGGAAACTTCT TT TC AGAT GT T ACC AAT 
AAGCAATGAACCCATCTCGAAGGAGGAGTTTGTAATACACACCATATTTCCTCCTAATGGAATGAATCTGT 
ACAAAAACAATCACAGTGAAAGCAGTAGTAACCGAAGCTACAGAGACAGTGGTATTCCAGACTCAGATCAC 
TC AC AC TCATGATGCT GAAGGACT C AC AGC AG ACT TGT GT TTT GGGTT T T TT AAACCT AAGGGAGGTG AT G 
GT 
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FIGURE 132 

MISAAWSIFLIGTKIGLFLQVAPLSVMAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQINNAG I PS DLKNLLKVERI YLYHNS LDE FPTNLPKYVKELHLQENNIRT I T YDS LSKI PYL 
EELHLDDNSVSAVS IEEGAFRDSNYLRLLFLSRNHLS T I PWGLPRT IEELRLDDNRI S T I S S PSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFS YLRQL YRLDMSNNNLSNL PQG I FDDLDNI T QL I LRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAI KDLNAEL FDCKDSGI VS T I QI T TAI PNT VYPAQGQWPAPVTKQPD 
IKNPKLTKDQQTTGSPSRKTITITVKSVTSDTIHISWKIALPMTALRLSWLKLGHSPAFGSITET 
I VTGERSE YLVTALEPDSPYKVCMVPME TSNLYLFDE TPVC I E TE TAPLRMYNPTT TLNREQEKE 
PYKNPNLPLAAI I GGAVALVT I AL LAL VC W YVHRNG S L F S RNC AYS KGRRRKDD YAE1AG T KKDNS 
ILE IRETS FQMLP I SNEPI SKEEFVI HT I FPPNGMNLYKNNHSES SSNRS YRDSGI PDSDHSHS 

Important features of the protein: 
Signal peptide: 
amino acids 1-28 
Transmembrane domain: 
amino acids 531-552 
N-glycosylation sites. 

amino acids 226-229, 282-285, 296-299, 555-558, 626-629, 633-636 
Tyrosine kinase phosphorylation site, 
amino acids 515-522 
N-myristoylation sites. 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Amidation site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 
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FIGURE 133 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
GACCAC TCCAACAGAGCCAGGACTATAT CAACCT C T TCTGCGCCAACAT GAT GGAC T TGAACCGC 
AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 



fjf TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
3 AACAGCAGGAAGGATGCT T CGGGGAGC CTGATGC TGAAGAT GAAGAAT TATCTAAAGCTAT T CAA 



TATCAGCAGCAT T T T T C GAGGAGAGTGAAGAGGCGAGAAAAACAAT T T CCAGAT T CT CGCTC T GT 
TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 
AATTCTCTTGCCTCATCCTCCCGAGTAGCTGGGACTACAGGAGCGTGCCACCATACCTGGCTAAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACCCAGGTGTGAACGGATGAATAAAGTTCAACTGCAACTGAAAAAAAAAAA 
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FIGURE 124 

MSARGRWEGGGRRACRGS LGLARAQGAERVT S SEQRPAMAS LGLLLLLLLTALPPLWS S S LPGLD 
TAESKAT I ADL ILSALERAT VFLEQRLPE I NLDGMVGVRVLEEQLKS VREKWAQE PLLQPLS LRV 
GMLGEKLEAAI QRSLHYLKLSDPKYLRE FQLTLQPGFWKLPHAW IHTDASLVYPT FGPQDS FSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCAI^DLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 



Important features of the protein: 
Signal peptide: 



3.3 amino acids 1-2 6 



Transmembrane domain : 

amino acids 39-56 

s 

*-k Tyrosine kinase phosphorylation sites. 



amino acids 149-156, 274-282 
N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 208-214 



Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 135 



.SSI; 
CI 



5 ; ::r 



GGTCTGAGTGCAGAGCTGCTGTCATGGCGGCCGCTCTGTGGGGCTTCTTTCCCGTCCTGCTGCTG 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCT.GCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC- 
AGCTTACAGATT TGAT CCCGTTCGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
TGAATTACAT CAAAAC AT CAGAGGTTGT CAGACTGCCCTAT CCT CT CCAAAT GAAATCT T CAGGT 
CCACCT TCTTACTTTAT TAAAAGGGAAT CGTGGGGCTGGACAGACT TTCTAATGAACCCAATGGT 
TATGATGATGGTTCTTCCTTTATTGATATTTGTGCTTCTGCCTAAAGTGGTCAACACAAGTGATC 
C TGACATGAGACGGGAAATGGAGCAGT CAATGAATAT GCT GAAT TCCAACCAT GAGTTGCCTGAT 
GT T TC T GAGT TCATGACAAGACTC T TC T CT TCAAAAT CATC T GGCAAAT CTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGT GGGGC TGGCAAAAGGAGG TAGT CAGGCCGTCCAGAGC TGGCAT TTGCAC 
AAACACGGCAACAC TGGGTGGCATCCAAGT CT TGGAAAACCGTGT GAAGCAAC TACTATAAAC T T 
GAGTCAT CCCGACGT TGAT CT CT TACAACT GT GTATGT T 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGGT CAAT AT TGATGT CACTGAAT TAAT TACAGTGTCCTATAGAAAAT GCCAT TAATAAAT TAT 
ATGAAC TAC TATAC AT TATGTATAT TAAT TAAAACATC T TAATCCAGAAATCAAAAAAAAAAAAA 
AAAAAAAAAAAAAAA 
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FIGURE 136 



MAAALWG FFPVLLLLLLS GDVQS S E VPGAAAEGS GGS GVGI GDRFKI EGRAWPGVKPQDW I SAA. 
RVLVDGEEHVGFLKTDGSFWHDIPSGSYWEWSPAYRFDPVRVDITSKGKMRARYVNYIKTSE 
VVRLPYPLQMKSSGPPSYFIKRESWGWTDFLl^PMVMMW 
QSMNMLNSNHELPDVSEFMTRLFSSKSSGKSSSGSSKTGKSGAGKRR 



Important features of the protein: 
Signal sequence: 

amino acids 1-23 



?£f Transmembrane domain: 

amino acids 161-182 

§4 N-glycosylation site. 

amino acids 184-187 



■ sii- Glycosaminoglycan attachment sites. 

Ft amino acids 37-40, 236-239 

m 

^ cAMP- and cGMP-dependent protein kinase phosphorylation site. 

Ill 

ammo acids 151-154 
N-myri s toyl a ti on s i tes . 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 



APP ID= 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
TTTTCTTACTCCTGTGGTCTCCAGATTTCAGGCCTAAGATGAAAGCCTCTAGTCTTGCCTTCAGG 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
jU AAGCT G TGT GAT CGCCACAAACC T TCAGGAAATACGAAAT GGAT T T TCT GAGATACGGGGCAGTG 

.0 TGCAAGCCAAAGATGGAAACAT T GACATCAGAATC T TAAGGAGGAC TGAGT CT TTGCAAGACACA 

JJJ AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
y J AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
M CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
jhV AAGAAATACAGCCAGAT TCTGAGTCAC T T T GAAAAGCT GGAACCTCAGGCAGCAGT TGT GAAGGC 

4? TTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGCTG 
C TGCTAAGAATAT TCGAGG TCAAGAGC T CCAGTC T TCAAT ACCTGCAGAGGAGGCAT GACCC CAA 
>sf ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
ft! T GTAAT AT CT T TC TGCTAT TGGATATAT T TAT TAGTTAATATAT T TAT T TAT T TT TTGC TAT T TA 

13 ATGTAT TT AT T T T T T TACT TGGACATGAAACT TTAAAAAAAT TCACAGAT TATAT T TAT AACC TG 

mi 

* ACTAGAGCAGGTGATGTATTT TTATACAGTAAAAAAAAAAAACCT TGTAAAT TCTAGAAGAGTGG 

C TAGGGGGGT TAT TCAT TTGTAT TCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAAT T GAACCAATGAC TAC T TAGGAT GGGT TGTGGAATAAG TT T TGATGT GGAAT TGCAC 
ATCTACCTTACAATTACTGACCATCCCCAGTAGACTCCCCAGTCCCATAATTGTGTATCTTCCAG 
CCAGGAAT CCTACACGGCCAGCATGTAT T T CTACAAAT AAAGT T T TCT T TGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 



MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
S SREAWQFFLLLWS PD FRPKMKAS S LAFS LLSAAF YLLWT PS TGLKTLNLGS CVI ATNLQE I RNG 
FS E I RGS VQAKDGNI D I RI LRRTE S LQDTKPANRCCLLRHLLRL YLDRVFKNYQT PDH YTLRKI S 
S LANS FLT I KKDLRLSHAHMTCHCGEEAMKKYS Q I LSHFEKLE PQAAWKALGELD I LLQWMEE T 
E 

Important features of the protein: 
Signal peptide: 

amino acids 1-42 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 12Q 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
TTTGTGGCTGAAGCAAAGCCACCTACAGTTACTATGCCTCGAATCAAGGCATTGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGC AAAAGCAGCT GGAAAAAGAATAGTC TT T TAT GGAGATGAAAC C TGGGTT AAAT T A 
T TCCCAAAGCAT T T TGT GGAATAT GAT GGAACAAC CT CAT T T TT CGTGT CAGAT TACACAGAGGT 
GGATAATAATGTCACGAGGCATTTGGATAAAGTATTAAAAAGAGGAGAT TGGGACATAT TAAT CC 
M TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
B AAGCTGAGCGAGATGGACAGCGTGCTGATGAAGATCCACACCTCACTGCAGTCGAAGGAGAGAGA 
P GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
I'sl GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
fl CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
W AC TTGGCT TACCGAT T C CAAAAGACAGT GTAGGGAGCC T CC TAT T CCCAG T T GTGGAAGGAAGAC 

%'J CAAT GAGAGAGC AG T T GAGAT T T T T AC AT T T GAATAC AG T GCAGC T TAG T AAAC T G T T G CAAGAG 

y ;! AATGTGCCGTCATATGAAAAAGATCCTGGGTTTGAGCAGTTTAAAATGTCAGAAAGATTGCATGG 
j» GAACTGGAT CAGAC TG T ACT TGGAGGAAAAGCAT T CAGAAGTCCTATT CAACCTGGGCTCCAAGG 

TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
f M TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
CI CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
Iff CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
pi GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
C!l CAAGGTGATTGTAAAGAGCTGGCGGTCACAGAGGAACAAGCCCCCCAGCTGAGGGGGTGTGTGAA 
f|l T C G GACAGC C T CC C AGCAGAG GT G T GG GAGC TGCAGC T GAG GGAAGAAGAGAC AATC GG C CT G GA 

CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
T CGGACACCT TCAT TCGT TAGC TGGGGAGT GGTGGTGAGGCAGT GAAGAAGAGGCGGAT GGT CAC 
AC T CAGAT CCACAGAGCCCAGGATCAAGGGACCCAC T GCAG T GGCAGCAGGAC TG T TGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
CTGACCGAGACAC T CACAGCT T TGTCATCAGGGCACAGGCT T CCT CGGAGCCAGGAT GAT CTGT G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
CTGCACACAGTATGTAGTTACCAAAAGAATAAACGGCAATAATTGAGAAAAAAAA 
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FIGURE 1AQ 



MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGWFMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMS ETGSHGASS TEEVNT PL I L I S SAFERKPGD IRHPKHVQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-34 

Transmembrane domain: 

amino acids 58-7 6 

N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 

Amidation site. 

amino acids 154-158 

Cell attachment sequence. 

amino acids 205-208 



APP 



n>= 



10063714 



Page 294 



FIGURE 141 



GGCACGAGGCAAGCC T T CCAGGT TATCGT GACGCACC T TGAAAG TC TGAGAGC TAC TGCCC TACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
T CCCTATAGAAAACAACTGCCAGCACCT TAAGACCACT CACACCTTCAGAG T GAAGAACT TAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGTT CCAGATAAAAACTACATACGCCCAGAGATC T T CT TTGCAT TAGCCTCAT CC T T GAGC T 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCC T TCAGCTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
GTTGGGGTGACAGATAAATT TGAGAACAGGAAACACAT T GAAT TT TCAT TTCAACCAGT TT GCAA 
AGCTGAAATGAGCCCCAGTGAGGTCAGCGATTAGGAAACTGCCCCATTGAACGCCTTCCTCGCTA 
AT TTGAACTAAT TGTATAAAAACACCAAACCTGCTCACT 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFALASSLSSASAEKGSPILLGVSKGE FCLYCDKDKGQSHPSLQLKKEKLMKI1AA.QKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFS FQPVCKAEMSPSEVSD 

cAMP- and cGMP-dependent protein kinase phosphorylation site* 

amino acids 33-36 

N-myristoylation site* 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 

C T AGAGAGT ATAGGGCAGAAGGAT GGCAGAT GAGT GACTCCACAT CCAGAGCTGCC TCCC TT TAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
CTGGCATTTCCACGAGCAAAGGGACTGTGATGAACACAATGTCATGGCTCGTTACCTCCCTGCCA 
CAGTGGAGTTTGCTGTCCACACATTCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 
lZ GGGGAGAAC TAGGTGTGGGAAAT T TGAAGACGACATT GACAACTGCCAT TT CCAAGAAAGCACAG 

P| AGCTGAACAATACTTTCACCTGCTTCTTCACCATCAGCACCAGGCCCTGGATGACTCAGTTCAGC 
m CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 
W GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
y$ TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
p ACACAT C TT GAGC C TAATCATGTAG T GTAGATCAT TAAACAT CAGCAT T TTAAGAAAAAAAAAAA 

SSfSil 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^ 

h 
m 

m 
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FIGURE 144 



MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNNTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 . 

N-glycosylation sites. 

amino acids 117-121, 139-143 

N-myristoylation site. 

amino acids 9-15 



APP_ID= 
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FIGURE 145 



C TGTGCAGC T CGAGGCT CCAGAGGCACACT CCAGAGAGAGCCAAGGT T C T GAC GC GATGAG GAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCAACTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
AAGCT T GCCAGGAGGC TGGC AGTACAGAGCGCAGCAGCGAGCAAATCC T GGCAAGTGACCCAGCT 
CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 
TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCACTCCAT TCTCAGCC CCTAGCAGAGCGTC T GGCACACTAGAT TAGTAGTAAATGC T TGAT 
GAGAAGAACACATCAGGCACTGCGGCACCTGCTTCACAGTACTTCCCAACAACTCTTAGAGGTAG 
GTGTAT TCCCGTT TTACAGATAAGGAAACTGAGGCCCAGAGAGCTGAAGTACTGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 
C T C T AAT GAAAT T G T GAAAG C T C CATG T T T AGAAAT AAAT GAAAACAC C T GA 
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FIGURE 146 

MRKHLSWWWLATVCMLLFSHLSAVQTRGIBCHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIW3.MVK 



Important features of the protein: 
Signal peptide: 

amino acids 1-26 



CI Transmembrane domain: 

y amino acids 157-171 

M N-glycosylation sites. 

1-4 amino acids 98-102, 110-114 

^ Tyrosine kinase phosphorylation site. 

JsJ- amino acids 76-83 



CM 



N-myristoylation sites. 

amino acids 71-77, 88-94, 93-99, 107-113, 154-160 



Amidation site. 

amino acids 62-66 
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FIGURE 147 



^1 

fas Is 

47 



GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CC T GAT GATT TATAGAC TCAAAGAAAAC TC ATG T TCAGAAGC TC T CT T C T CT TCT GGCCT CCT CT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAACTTTCAGATTCAGGGGGTACATGTGAAGGTTTGTTTTATGAGTATATTGCATGATGCTGAGG 
TTTGGGGT 
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FIGURE 1A8 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE 1J.Q 

GTCTCCGCGTCACAGGAA.CTTCAGCACCCACAGGGCGGACAGCGCTCCCCTCTACCTGGAGACTTGAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCGCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCT ACAGCT CCAGGAGCCCAGC GCCGGGCTGTGACCCAAGC CGAG CGTGGAAGAATGGGGT T 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTTGAATGAACAGATTGCTGAA 
GCAGAAGAAGACAAGATTAAAAAAACATATCCTCCAGAAAACAAGCCAGGT CAGAGCAACTATT CTT T 
TGTTGATAACTTGAACCTGCTAAAGGCAATAACAGAAAAGGAAAAAATTGAGAAAGAAAGACAATCTA 
N TAAGAAGCTCCCCACTTGATAATAAGT TGAAT GTGGAAGAT GT TGAT TCAACCAAGAATC GAAAACT G 

W ATCGATGATTATGACTCTACTAAGAGTGGATT GGATCATAAAT TT CAAGAT GATCCAGAT GGTCTTCA 

T CAACTAGACGGGACTCCT TTAACCGCTGAAGACATTGTCCATAAAATCGC TGCCAGGAT TTAT GAAG 
J. I AAAAT GACAGAGCCGT GTT TGACAAGATTGTT TCTAAACT ACT TAAT CTCGGCCTTATCACAGAAAGC 

%l CAAGCAC^TACACTGGAAGATGAAGTAGCAGAGGTTTTACAAAAATTAATCTCAAAGGAAGCCAACAA 
SU TTATGAGGAGGATCCCAATAAGCCCACAAGCTGGACTGAGAATCAGGCTGGAAAAATACCAGAGAAAG 
Jl TGACT CCAATGGCAGCAAT TCAAGATGGT CTT GCTAAGGGAGAAAAC GATGAAACAGTAT CTAACACA 

^' TTAACCTTGACAAAT GGCT TGGAAAGGAGAACTAAAACCT ACAGT GAAGACAACTT TGAGGAACTCCA 

ATATT TCCCAAATTT CTAT GCGCTACTGAAAAGTATTGAT T CAGAAAAAGAAGCAAAAGAGAAAGAAA 
CACT GATTACTATCAT GAAAACACT GATT GACTT TGTGAAGAT GATGGT GAAATAT GGAACAATATCT 
Si CCAGAAGAAGGTGTTTCCTACCTTGAAAACTTGGATGAAATGATTGCTCTTCAGACCAAAAACAAGCT 
P! AGAAAAAAATGCTACTGACAATATAAGCAAGCTTTTCCCAGCACCATCAGAGAAGAGTCATGAAGAAA 
p| CAGACAGTACCAAGGAAGAAGCAGCTAAGATGGAAAAGGAATATGGAAGCTTGAAGGATTCCACAAAA 
GATGATAACTC CAAC CCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTAT T TGGAAGC 
CATCAGAAAAAATAT TGAATGGTT GAAGAAACATGACAAAAAGGGAAATAAAGAAGATTAT GACCTTT 
CAAAGAT GAGAGACTTCATCAATAAACAAGCTGATGCTTAT GT GGAGAAAGGCATCCTTGACAAGGAA 
GAAGCCGAGGCCATCAAGCGCATT T ATAGCAGCCTGTAAAAAT GGCAAAAGAT CCAGGAGT CTT TCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
GTTATTAGAAAGTGCTGAATT TACAGTAGTTAACCTT TTACAAGTGGT TAAAACATAGCTT TCT TCCC 
GTAAAAACTATCTGAAAGTAAAGTT GTATGTAAGCTGAAAAAAAAAAAAAAAAAAAA 
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FIGURE lso 



M 



MGFLGTGTWILVLVLPIQAFPKPGGSQDKSLHNRELSAERPLNEQIAEAEEDKIKKTYPPENKPG 
QSNYS FVDNLNLLKAI TEKEKI EKERQS IRS S PLDNKLNVEDVDS TKNRKL I DDYDS TKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKL I S KEANNYEEDPNKPT S WTENQAGKI PEKVT PMAAI QDGLAKGENDETVSNTLT LTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETDSTKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 

Casein kinase II phosphorylation site: 

amino acids 70-73, .82-85, 97-100, 125-128, 147-150, 188-191, 217- 

220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 



N-myristoylation sites: 

m amino acids 143-148, 239-244 



APP_ID=10063714 



Page 304 of 322 



FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAATAACCAGCTTCTAGCTGGAGGGCTGCATGGAGGGAAGGTCATTAAAGGTGAAGAGATCAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAAT CAC TCTCTCTGCTCT CAGGACC CCCACGT CTGACT TAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
GGGTCAGTAGCTCTCCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 
GAAATGGCTCGAGCTCAGAAGATAAAAGATAAGTAGGGTATGCTGATCCTCTTTTAAAAACCCAA 
GATACAATCAAAATCCCAGAT GC TGGTC TC TAT TCCCATGAAAAAGTGCTCATGAGATAT T GAGA 
AGACCTACTTACAAAGTGGCATATATTGCAAT TTATTT TAATTAAAAGATACCTAT TTATATATT 
TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTGAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATT TT TC T T T TAAT TC TGT TAAT TT AT CT GTAT T T CCTAAT T T T TCTACAAT GAAGAT GA 
AT T CCT T GTATAAAAATAAGAAAAGAAAT T AATC T TGAGGTAAGCAGAGCAGACATCATCTCTGA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTGAATCGAGCTCTGCTGCTCTGGTTGG 
TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 
GTGGCTGGAATCTCTGGGTAAGGAACTTAAAGAACAAAAATCATCTGGTAATTCTTTCCTAGAAG 
GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 
ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTCTCTGCAGATGTAGT TAGT TAAGACAAGGTCATGCTGGAT GAAGGTAGACC TAAATTCAATAT 
GAC TGGT T T CCT TGTAT GAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTAT GTA 
AAGATGAAGGCAGAGAT C GGAGT T T TGCAGCCACAAGCTAAGAAACACCAAGGAT T GTGGCAACC 
ATCAGAAGCT TGGAAGAGGCAAAGAAGAAT TC TTCCCTAGAGGCT TTAGAGGGATAACGGCTCTG 
C TGAAACCT TAATC TCAGACT TC CAGCCTCCTGAACGAAGAAAGAATAAAT T TCGGC TGT TT TAA 
GCC AC CAAGGATAAT T GGT TACAGCAGC TC TAGGAAAC T AAT ACAGCT GCTAAAAT GAT CCCTGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGT GAT GGCATGCCAC T T CCAAGAT T AGGTTATAAAAGACAC TGCAGCTT C 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GC CCACAGCCACAT TAGTGAACC TAGAAGCAGAGAC T C T GT GAGATAAT CGATGT T TGT T GT T TT 
AAGTTGCTCAGTTT TGGTCTAACT TGT TATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 
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FIGURE 152 



MVXSGALCFRMKDSALKVLYLHNNQLLAGGLHAGKVIKGEE I SWPNRWLDASLSPVILGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAP I TD FY FQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature. 

amino acids 111-131 

Interleukin-1 proteins. 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE isa 



CTTCAGAACAGGTTCTCCTTCCCCAGTCACCAGTTGCTCGAGTTAGAATTGTCTGCA ATG GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
TAT TGAAGGTGATGAC CT GCATATCCAGAGGAAT G TGCAAAAGCTGAAGGACACAGT GAAAAAGC 
t»* T TGGAGAGAGTGGAGAGATCAAAGCAAT T GGAGAAC T GGATT TGC TGT T TAT GTCTC TGAGAAAT 

J;;! GCCTGCATTTGACCAGAGCAAAGCTGAAAAATGAATAACTAACCCCCTTTCCCTGCTAGAAATAA 
S| CAATTAGATGCCCCAAAGCGATTTTTTTTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
S| AGACCAGAAGGTAGACTTTCTAAGCATAGATATT TAT TGATAACAT TT CATTGTAAC TGGTGTTC 

^ ;;;fs . TATACACAGAAAACAAT T TAT TT T T TAAATAAT TGTCTTTTT CCATAAAAAAGAT TACT T TCCAT 

P 

v TCC TT TAGGGGAAAAAACCCCTAAATAGCT T CAT GT T T CCAT AAT CAGTACT T TATATT TATAAA 

p| T GT AT T TAT TAT TAT T AT AAGAC T GC AT T T TAT T TAT AT CAT T T TAT T AAT AT GGAT TT AT T TAT 

Ill AGAAACATCATTCGATAT T GC TAC T T GAGTGTAAGGC TAATAT T GAT AT T TAT GACAATAAT TAT 

13 AGAGC TATAACATGTTTATT TGACCTCAATAAACACT TGGATATCCC 
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FIGURE 154 

MAALQKS VS S FLMGTLAT S CLLLLALLVQGGAAAP I S SHCRLDKSNFQQP Y I TNRT BMLAKEAS L 
ADNNTDVRLIGEKLFHGVSMSERCYmKQVLNFTLEEVLFPQSDRFQPYMQEWPFIARLSNRLS 
TCHIEGDDLHIQRNVQKLKDTYKKLGESGEIKAIGELDLLFMSLRNACI 



Important features of the protein: 
Signal peptide: 

amino acids 1-33 

^* N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 

|lj N-rayristoylation sites. 



m 



amino acids 14-20, 82-88 



Prokaryotic membrane lipoprotein lipid attachment site. 



amino acids 10-21 

IS* 
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FIGURE 155 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGCCTGAGGGCTGTGCTGAGAGGGAGAGGAGCAGAGATGCTGCTGAGGGTGGAGGGAGGCCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
5| GC T GCT CTACCACAACCAGACTG TCT T CTACAGGCGGCCAT GCCATGGC GAGAAGGGCAC CCACA 

hj AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
SI GTGATGGGCTAGCCGGACCTGCTGGAGGCtGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 
\l ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
^ CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 

2: 

f*& CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
Iff GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 
TTGACATGGATGATTCTGAGGAGGAAGCTGT TATTGAAT GTATAGAGAT TTATCCAAATAAATAT 
CTTTATTTAAAAATGAAAAA 
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FIGURE 156 

MRERPRLGEDSSLISLFLQWAFIJ^MVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 
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FIGURE 157 



CCGGCGATGTCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT T GAGGGACCT CCGAGTAGAACC TGT TACAAC T AGTGT T GCAACAGGGGAC TAT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATTTGTGTGACGGGCAAAAGCAACTTCCAGTCCTACAGCTGTGTGAGGTGCAATTACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAG TCTATT T CAT T GGGGCCCATAATAT T CCT AAT GCAAATATGAATGAAGAT GG 
C CC TT CCAT GTC T GTGAAT T T CACCT CACCAGGCT GC CT AGACCACATAATGAAATATAAAAAAA 
if AGTGT GT CAAGGCCGGAAGCCT GTGGGAT CCGAACAT CAC T GCT TGTAAGAAGAAT GAGGAGACA 

| GTAGAAGTGAACTTCACAACCACTCCCCTGGGAAACAGATACATGGCTCTTATCCAACACAGCAC 
m TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACGAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
III TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 
~* AGCGACTGCATCCGACATAAAGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
%~ GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
O TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
III TCACACAATTTGTTACTTCACTGAATTTCTTCAAAACCATTGCAGAAGTGAGGTCATCCTTGAAA 
AGTGGC AGAAAAAGAAAATAGCAGAGAT GGG TCCAGTGCAG T GGC T TGC CACT CAAAAGAAGGCA 

yj 

J* GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 

w 

fijf GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGATCTAAGAAGCCAGATTCATCTGCACAAATACGTGGTGGTCTACTTTAGAGAGATTGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 



APPJD=10063714 



Page 311 of 322 



FIGURE 1*8 



MSLVLLSLIU^CRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 

LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 

NANMNEDGPSMSVNFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMAL 

STIIGFSQVFEPHQKKQTRASWIPVTGDSEGATVQLTPYFPTCGSDCIRHKGTWLCPQTGVPFPLDNNK 

SKPGGWLPLLLLSLLVATWLVAGIYJMRHERIKKTSFSTTTLLPPIKVLVVYPSEICFHHTICYFTEFL 

QNHCRSEVILEKWQKKKIAEMGPVQWLATQKBCAADKVVFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 

FNLFCSDLRSQIHLHKYVWYFREIDTKDDYNALSVCPKYHLMKDATAFCAELLHVKQQVSAGKRSQACHD 

GCCSL 

Important features of the protein: 
Signal peptide : 

amino acids 1-14 

Transmembrane domain: 

amino acids 290-30 9 

N- glycosylate on sites. 

amino acids 67 - 71, 103 -107, 156 - 160, 183 - 187, 197 - 201 and 283 

- 287 

cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 

Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 
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FIGURE- isq 



AGCCACCAGCGCAACATGACAGTGAAGACCCTGCATGGCCGAGCCATGGTCAAGTACTTGCTGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCAAAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCAACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCATCCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAG TAAG AGGTGCATATCC 
H ACTCAGCTGAAGAAG 

Li 

m 

M 

k, E 



y 



?§'! 
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FIGURE 160 



MTWTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCS VS FQLEKVLVTVGC T C VT PVI HHVQ 

Signal sequence: 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 

N-myristoylation sites. 

amino acids 106-111, 136-141 
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FIGURE 161 



ACACTGGCCAAACAAAAACGAAAGCACTCCGTGCTGGAAGTAGGAGGAGAGTCAGGACTCCCAGG 
ACAGAGAGTGCACAAACTACCCAGCAGAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAGATGCCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGCTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCCT GAAGATGAGGAAAAGT T TGGAGGAGCAGCTGACTCAGGGGT GGAGGAGCC T AGGAA 
TGCCTCTCTCCAGGCCCAAGTCGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCTTCGAGGCTGCCCTAGGGAGTGAGGTACGAATCTGGTCCTATACTCAGCCCAGGTACGAGAA 
j^l GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
!!f ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
W GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 
ffl CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
III CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
Q CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
B "f GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGCTTTCCTGGGAGAACG 
TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
4? AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
* CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 
W'f CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
0 CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
PI CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
m CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 
sigl CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGGCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACTTAAATAAAGGCAGACGCTGTTTTTCTAAAAAAA 
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FIGURE 162 



MPVPWFLLSLMjGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQKETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYIX:FEAALGSEVRIWSYTQPRYEKELNH^^ 

EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSWLLDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWENVTVDKVLEFPLLKGHPNLCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWACPMDKY IHKRWALVVfLACLL FAAALSL ILLLKKDHAKGWLRLLKQDVRSGAAARG 
RAALLLYSADDSGFERLVGALASALCQLPLRVAVDLWSRRELSAQGPVAWFHAQRRQTLQEGGWVLLFSP 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 

Signal sequence: 

amino acids 1-20 

Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 

N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCG ATGA GGACGCTGCTGACCATCTTGACTGTGGGATCCCTGGCT 
GCTCACGCCCCTGAGGACCCCTCGGATCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 
AAACATCCTGACGTGGGACAGCGGGCGAGAGGGCACCCCAGACACGGTCTACAGCATCGAGTATA 
AGACGTACGGAGAGAGGGAC TGGGTGGCAAAGAAGGGC TGT C AGCGGAT CACCCGGAAGT CCT GC 
AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 
GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 
CC TCAAGCCACC TGAT GT GACCTGTAT CT CCAAAX3TGAGAT CGAT TCAGAT GAT T GT T CATCC TA 
CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 
TT CTACCACT TAGAGCT CCAGGT CAACCGCACC TACGAAAT GCACCT T GGAGGGAAGCAGAGAGA 
ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 
CC TGGGCCAAGGAGAGT GCCCCCTACAT GT GCCGAGT GAAGACACTGC CAGACCGGACATGGACC 
, , TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 
CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 
CJ TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 
r| AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 
fi| TCCACAGCGGCATAGCCTGT CCGAGATCACCTAC T TAGGGCAGCCAGACATCTCCATCCTCCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 
jf f GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 
^1 GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 
H CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 
> CACCTT AGGCCTAAAGGTCAGCT T CAGAAAGAGCCACCAGC T GGAAGC T GCATG T TAGGTGGCC T 

T TC T C T GCAGGAGGT GACCT CC T TGGCTAT GGAGGAAT C CCAAGAAGCAAAATCAT TGCACCAGC 
Z.^ CCC TGGGGAT T TGCACAGACAGAACAT C TGACCCAAATG TGCT ACACAGT GGGGAGGAAGGGACA 

CI CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 
Of CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 
f;| TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 
J» GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 
Jf AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 
CI ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 
f|| AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 
AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 
CAAGGCAGAAAT GACAG T GCAAGGAGGAAAT GCAGGGAAACT CCCGAGG T CCAGAGC CCCACCT C 
CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 
ACAATCTAGC TCGACAGAGCATGAGGCCCCT GCCTCTT CT GT CAT TGT T CAAAGGTGGGAAGAGA 
GCC TGGAAAAGAACCAGGCC TGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCT GC 
ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 
TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 
GAAATGCAGGTCCACCAGGGAGGGAGACACACAAGCCTTTTCTGCAGGCAGGAGTTTCAGACCCT 
ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 
TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 
CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 
ATCTGTAATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 
TCTTTAAAGTGCTTAATAGTGCCTGGTACATGGGCAGTGCCCAATAAACGGTAGCTATTTAAAAA 
AAAAAAAA 
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FIGURE i6a 

MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCNLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRS I QMIVHPT PT P I RAGDGHRL TLEDI FHDL FYHLE LQVNRT YQMHLGGKQRE YE FFGLTPDT 
E FLGT IMI CVPTWAKE SAPYMCRVKTLPDRTWT YS FSGAFL FSMG FLVAVLCYLS YRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSNVPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISKVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 

fck SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

W 

Jjjj Signal sequence. 

r„ s amino acids 1-17 

M< Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

P| amino acids 80-83, 87-90, 172-175 

1 

C| N-rayristoylation sites. 

""^ amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 16* 

TGGCCTACTGGAAAAAAAAAAAAAAAAAAAAAAAGTCACCCGGGCCCGCGGTGGCCACAA.CATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATATGTCTACTACAAACTGGCAGGGGGATCCCTTGAAGTTTGGGCTGGA 
AGTGT TGAACACAGTTTT GGATATT TTCCAAAAGATTTGATCAAGGTACTTCATAAATACACGGA 
AGAAGAG C T AC AT AT T C C AG C AGAT GAG AC AGAC TTTGTCTGCTTT GAAG G AGGAAG AG AT GAT T 
T TAATAGT TATAATGTAGAAGAGCTTTTAGGATC TT TGGAACTGGAGGACTCTGTACCT GAAGAG 
T CGAAGAAAGCTGAAGAAGT T TCTCAGCACAGAGAGAAAT C T CC TGAGGAGT CTCGGGGGCGT GA 
ACTTGACCCTGTGCCTGAGCCCGAGGCATTCAGAGCTGATTCAGAGGATGGAGAAGGTGCTTTCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCAGCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
AT TGAAAG T GCCGGGAAGCGAAAGCAGAACTGGCAATAGT T CTCC TGCC TCGGT GGAGCGGGAGA 
AGACAGAT GCTTACAAAGT CCTGAAAACAGAAATGAGT CAGAGAGGAAGTGGACAGTGCGTTATT 
CATTACAGCAAAGGAT TTCGTTGGCATCAAAATCTAAGTTTGTT TTACAAAGAT TGTT TT TAGTA 
CTAAGCTGCCTTGGCAGTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
AA&AAAAAAAAAAAAAAAAA 
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FIGURE 166 



MAAAPGLLF^LFVLGALWWVPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKTDAYKVLKTEMSQRGSGQCVIHYSKGFRWHQNLSLFYKDCF 

Important features of the protein: 
Signal peptide: 

amino acids 1-22 

N-glycosylation site. 

amino acids 294-298 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-7 6 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 

Amidation site. 

amino acids 28-32 
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FIGURE 167 



hi 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAAGCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCACAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTG 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG. 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCTTGAGAATGAAGCAGAACAGAAGTTAATAGAGAGCATGTTGCAAAACCTGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCTTCCTGCGGAAGTGAAAAGTGTGTTGTGATGTATCACCAACCAACTGCCAATCCTGGC 
O C T T GGGGGTCCCTACC T T TACCAGTGGAAT GATGACAGGTGT AACATGAAGCACAAT TAT AT T T G 

0 CAAGTATGAACCAGAGATTAATCCAACAGCCCCTGTAGAAAAGCCTTATCTTACAAATCAACCAG 
iff GAGACACCCATCAGAAT GT GGTTGT TAC TGAAGCAGGTATAATTCCCAATCTAAT T TAT GT T GT T 

i, i ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
f% GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAGTCTACACTGTGGATTTCAAAGAGTA 
y* CCAGAAAAGAAAGTGGCATGGAAGT ATAA TAACTCAT TGACT TGGT TCCAGAATTTTGTAATTCT 

fck GGATCT GT ATAAGGAAT GGCATCAGAACAATAGC T TGGAAT GGC TT GAAATCACAAAGGAT CT GC 

AAGATGAACTGTAAGCTCCCCCTTGAGGCAAATATTAAAGTAATTTTTATATGTCTATTATTTCA 
TTTAAAGAATATGCTGTGC'TAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
ACT TCAAACT TCAAGCAAATGAAATGGACAATGCAGATAAAGTTGT TAT CAACACGTCGGGAGTA 
W TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
III T G TAT AT T G TAT T G AAAT T TAC AG T G T G C AAAAGT AT T T TAC C T T T G C AT AAG T G T T T G AT AAAA 

AT GAAC TGT T C TAATAT TTAT TT T TAT GGCATC TCAT T T T T CAATACAT GCT C T T T TGAT T AAAG 
ffl AAAC T TATTAC TGTTGT CAAC TGAATT CACACACACACAAAT ATAGTAC CATAGAAAAAGT TT GT 

TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
y AATAAGAAGCTAT TTCAT TAAGT GTGAT ATAAACCT CCT CAAACAT TT T ACT T AGAGGCAAGGAT 

fl| TGTCTAAT T T CAAT TGT GCAAGACATGT GCCT TATAAT TATT T T TAGCT TAAAAT TAAACAGAT T 

TTG TAATAAT G TAAC T T TGT TAATAGGT GCATAAACAC TAAT GCAGTCAAT T T GAACAAAAGAAG 
TGACATACACAATATAAAT CATATG TCT TCACACGT T GCCTATATAAT GAGAAGCAGC T C TC T GA 
GGGTTCTGAAA'TCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
ATTGACACTGGAGGCAGATAGTTGCAAAGTTAGTCTAAGGTTTCCCTAGCTGTATTTAGCCTCTG 
AC TATATT AGTATACAZVAGAGGTCATGT GGT TGAGACCAGGT GAATAG T CAC TAT CAGT GT GGAG 
ACAAGCACAGCACACAGACAT T T TAGGAAGGAAAGGAACTACGAAATCG TG TGAAAATGGGTTGG 
AACCCATCAGTGATCGCATAT TCAT TGATGAGGGT T T GCT T GAGATAGAAAAT GGTGGC TCC T T T 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESE 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDE PS CGS EKCWMYHQPTANPGLGGP YL YQWNDDRCNMKHNYI CKYE PE INPTAPVEK 
PYLTNQPGDT HQNVWTEAG I IPNLI YWI PT I PLLLL I LVAFGTCC FQMLHKS KGRTKTS PNQ 
STLWISKSTRKESGMEV 



Important features of the protein: 
Signal peptide: 

amino acids 1-21 

Transmembrane domain: 

amino acids 214-235 



N-glycosylation sites. 



w. 

*y amino acids 86-89, 255-258 



4* cAMP- and cGMP-dependent protein kinase phosphorylation site. 

^ amino acids 2 66-2 69 

f*\ N-myristoylation sites. 

HI amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 

C 145, 212-217 
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